RID: 2G3KCYK3013 Job Title:POLN_MAYAB Q8QZ73 Polyprotein P1234 (P1234)... Program: BLASTP Query: POLN_MAYAB Q8QZ73 Polyprotein P1234 (P1234) (Non-structural polyprotein) (Polyprotein P123') (P123') (Polyprotein P123) (P123) (mRNA-capping enzyme nsP1) (2.1.1.- {ECO:0000250|UniProtKB:P27282}) (2.7.7.- {ECO:0000250|UniProtKB:P03317}) (Non-structural protein 1) (Protease nsP2) (3.4.22.- {ECO:0000250|UniProtKB:Q8JUX6}) (3.6.1.15 {ECO:0000250|UniProtKB:Q8JUX6}) (3.6.1.74 {ECO:0000250|UniProtKB:P08411}) (3.6.4.13 {ECO:0000250|UniProtKB:Q8JUX6}) (Non-structural protein 2) (nsP2) (Non-structural protein 3') (nsP3') (3.1.3.84 {ECO:0000305}) (Non-structural protein 3) (nsP3) (3.1.3.84 {ECO:0000250|UniProtKB:Q8JUX6}) (RNA-directed RNA polymerase nsP4) (2.7.7.19 {ECO:0000250|UniProtKB:P03317}) (2.7.7.48 {ECO:0000255|PROSITE-ProRule:PRU00539}) (Non-structural protein 4) (nsP4) ID: lcl|Query_1956434(amino acid) Length: 492 Database: swissprot Non-redundant UniProtKB/SwissProt sequences Sequences producing significant alignments: Scientific Common Max Total Query E Per. Acc. Description Name Name Taxid Score Score cover Value Ident Len Accession RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Mayaro virus... NA 374990 1018 1018 100% 0.0 100.00 2437 Q8QZ73.3 RecName: Full=Polyprotein nsP1234; Short=P1234; AltName:... Ross river v... NA 11032 514 514 100% 9e-171 52.01 1149 P13888.2 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Semliki Fore... NA 11033 526 526 100% 1e-167 55.98 2432 P08411.2 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Getah virus NA 59300 512 512 100% 2e-162 52.38 2467 Q5Y389.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Sagiyama virus NA 59303 511 511 100% 4e-162 51.56 2467 Q9JGL0.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Ross river v... NA 11031 500 500 96% 3e-158 52.20 2480 P13887.2 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Chikungunya ... NA 371094 498 498 100% 1e-157 49.44 2474 Q8JUX6.1 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Chikungunya ... NA 371095 491 491 98% 4e-155 50.60 2474 Q5XXP4.1 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Igbo Ora virus NA 79899 467 467 66% 2e-146 65.12 2513 O90370.1 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... O'nyong-nyon... NA 11028 465 465 66% 7e-146 64.81 2514 P13886.2 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... O'nyong-nyon... NA 374989 463 463 66% 2e-145 64.51 2513 O90368.1 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Barmah Fores... NA 11020 421 421 100% 2e-130 48.30 2411 P87515.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Eastern equi... NA 374597 387 387 66% 8e-119 56.31 2474 Q306W8.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Eastern equi... NA 374596 385 385 66% 3e-118 55.35 2471 Q306W6.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Eastern equi... NA 374598 383 383 67% 2e-117 54.35 2494 Q4QXJ8.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Venezuelan e... NA 11036 381 381 66% 1e-116 55.76 2497 Q8V294.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Venezuelan e... NA 36384 380 380 66% 2e-116 55.42 2499 Q9WJC7.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Venezuelan e... NA 36385 379 379 66% 4e-116 55.15 2493 P36328.2 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Venezuelan e... NA 36382 378 378 90% 1e-115 45.76 2485 P36327.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Venezuelan e... NA 11038 370 370 90% 5e-113 44.68 2493 P27282.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Sindbis virus NA 11034 366 366 82% 2e-111 46.00 2513 P03317.2 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Aura virus NA 44158 365 365 76% 5e-111 51.16 2499 Q86924.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Ockelbo virus NA 31699 363 363 82% 1e-110 45.18 2515 P27283.2 RecName: Full=Polyprotein nsP1234; Short=P1234; AltName:... Middelburg v... NA 11023 338 338 84% 5e-105 48.43 995 P03318.2 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Sleeping dis... NA 78540 234 234 66% 6e-66 40.70 2593 Q8QL53.1 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Salmon pancr... NA 84589 229 229 62% 2e-64 41.28 2601 Q8JJX1.1 RecName: Full=Uncharacterized protein Saci_1252 [Sulfolobus... Sulfolobus a... NA 330779 62.0 62.0 22% 5e-10 34.43 181 Q4J9D2.1 RecName: Full=Uncharacterized protein SSO2899 [Saccharolobus... Saccharolobu... NA 273057 61.2 61.2 31% 8e-10 27.98 177 Q97UU4.1 RecName: Full=Uncharacterized protein STK_23830 [Sulfurisphaer... Sulfurisphae... NA 273063 57.8 57.8 22% 2e-08 34.17 182 Q96XY5.1 RecName: Full=Uncharacterized protein PAE1111 [Pyrobaculum... Pyrobaculum ... NA 178306 55.1 55.1 22% 1e-07 33.33 182 Q8ZXT3.1 RecName: Full=Uncharacterized protein FN1951 [Fusobacterium... Fusobacteriu... NA 190304 53.9 53.9 20% 2e-07 33.02 175 Q8RHQ2.1 RecName: Full=Uncharacterized protein TM_0508 [Thermotoga... Thermotoga m... NA 243274 56.6 56.6 23% 3e-07 30.71 599 Q9WYX8.1 RecName: Full=Macro domain-containing protein DR_2288... Deinococcus ... NA 243230 52.0 52.0 23% 1e-06 37.50 170 Q9RS39.1 RecName: Full=Macro domain-containing protein MA_1614... Methanosarci... NA 188937 50.8 50.8 30% 4e-06 29.56 195 Q8TQD0.1 RecName: Full=Macro domain-containing protein RSc0334 [Ralston... Ralstonia ps... NA 267608 49.7 49.7 22% 8e-06 33.62 171 Q8Y2K1.1 RecName: Full=Macro domain-containing protein VPA0103 [Vibrio... Vibrio parah... NA 223926 48.1 48.1 23% 2e-05 30.95 170 Q87JZ5.1 RecName: Full=Macro domain-containing protein MM_0177... Methanosarci... NA 192952 48.1 48.1 30% 3e-05 30.82 187 Q8Q0F9.1 RecName: Full=Macro domain-containing protein PG1779... Porphyromona... NA 242619 47.4 47.4 23% 4e-05 31.71 164 Q7MTZ7.1 RecName: Full=O-acetyl-ADP-ribose deacetylase 1; AltName:... Pantoea vaga... NA 712898 46.6 46.6 28% 9e-05 30.13 171 E1SDF1.1 RecName: Full=Macro domain-containing protein CT2219... Chlorobaculu... NA 194439 46.2 46.2 22% 1e-04 33.61 172 Q8KAE4.1 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 376267 48.1 48.1 25% 2e-04 35.56 2116 Q8BCR0.1 RecName: Full=ADP-ribose glycohydrolase MACROD2; AltName:... Homo sapiens human 9606 47.0 47.0 22% 2e-04 33.33 425 A1Z1Q3.2 RecName: Full=O-acetyl-ADP-ribose deacetylase 2; AltName:... Pantoea vaga... NA 712898 44.7 44.7 28% 3e-04 28.21 171 E1PL40.1 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 376264 47.0 47.0 20% 4e-04 36.45 2116 Q99IE5.1 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 376265 46.6 46.6 20% 4e-04 36.45 2116 Q99IE7.1 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 11045 46.6 46.6 20% 5e-04 36.45 2116 P13889.5 RecName: Full=ADP-ribose glycohydrolase MACROD2; AltName:... Mus musculus house mouse 10090 46.2 46.2 22% 5e-04 32.48 475 Q3UYG8.1 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 376266 46.2 46.2 20% 6e-04 36.45 2116 Q9J6K9.2 RecName: Full=Macro domain-containing protein in non 5'region;... Streptomyces... NA 1911 43.9 43.9 23% 8e-04 31.30 177 Q9KHE2.1 RecName: Full=Macro domain-containing protein LA_4133... Leptospira i... NA 189518 43.5 43.5 28% 0.001 29.22 175 Q8EYT0.1 RecName: Full=Macro domain-containing protein LIC_13295... Leptospira i... NA 267671 43.5 43.5 28% 0.001 29.22 175 Q72M93.1 RecName: Full=ADP-ribose glycohydrolase MACROD1; AltName:... Homo sapiens human 9606 44.3 44.3 22% 0.001 30.17 325 Q9BQ69.2 RecName: Full=Uncharacterized protein PH1513 [Pyrococcus... Pyrococcus h... NA 70601 43.1 43.1 23% 0.001 24.44 190 O59182.1 RecName: Full=Macro domain-containing protein SCO6450... Streptomyces... NA 100226 42.7 42.7 22% 0.002 32.23 169 Q9ZBG3.1 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 376262 44.7 44.7 20% 0.002 36.45 2116 Q6X2U4.1 RecName: Full=Macro domain-containing protein in sno 5'region;... Streptomyces... NA 38314 42.4 42.4 22% 0.002 32.52 181 Q9EYI6.1 RecName: Full=ADP-ribose glycohydrolase MACROD2; AltName:... Xenopus laevis African claw... 8355 43.9 43.9 22% 0.002 28.45 418 Q6PAV8.1 RecName: Full=Uncharacterized protein PYRAB06560 [Pyrococcus... Pyrococcus a... NA 272844 42.4 42.4 31% 0.003 22.78 183 Q9V0Y3.2 RecName: Full=Macro domain-containing protein PA3693... Pseudomonas ... NA 208964 42.4 42.4 22% 0.003 31.36 173 Q9HXU7.1 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 11043 44.3 44.3 20% 0.003 36.45 2116 Q86500.2 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 11044 43.9 43.9 25% 0.003 35.56 2116 O40955.1 RecName: Full=Protein mono-ADP-ribosyltransferase PARP9;... Mus musculus house mouse 10090 43.5 43.5 22% 0.004 28.57 866 Q8CAS9.2 RecName: Full=ADP-ribose glycohydrolase MACROD1; AltName:... Mus musculus house mouse 10090 42.0 42.0 23% 0.007 28.57 323 Q922B1.2 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 376263 42.4 42.4 20% 0.009 34.58 2116 Q6X2U2.1 RecName: Full=ADP-ribose glycohydrolase MACROD1; AltName:... Bos taurus domestic cattle 9913 41.6 41.6 22% 0.010 28.45 325 Q2KHU5.1 RecName: Full=Uncharacterized protein TV0719 [Thermoplasma... Thermoplasma... NA 273116 40.4 40.4 22% 0.010 27.56 186 Q97AU0.1 RecName: Full=Replicase polyprotein 1a; Short=pp1a; AltName:... Human corona... NA 11137 42.0 42.0 28% 0.012 27.63 4085 P0C6U2.1 RecName: Full=Replicase polyprotein 1ab; Short=pp1ab; AltName:... Human corona... NA 11137 42.0 42.0 28% 0.013 27.63 6758 P0C6X1.1 RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName:... Salmonella e... NA 454166 40.0 40.0 22% 0.013 32.26 179 B5F961.1 RecName: Full=Macro domain-containing protein TTE0995... Caldanaeroba... NA 273068 40.0 40.0 20% 0.014 29.73 175 Q8RB30.1 RecName: Full=Macro domain-containing protein in gbd 3'region;... Cupriavidus ... NA 106590 39.7 39.7 23% 0.016 31.50 173 Q44020.1 RecName: Full=Macro domain-containing protein mll7730... Mesorhizobiu... NA 266835 40.0 40.0 23% 0.017 32.79 176 Q985D2.1 RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName:... Salmonella e... NA 550538 39.7 39.7 22% 0.017 31.45 179 B5RBF3.1 RecName: Full=Uncharacterized protein PF1536 [Pyrococcus... Pyrococcus f... NA 186497 39.7 39.7 31% 0.020 22.91 183 Q8U0P9.1 RecName: Full=Protein-ADP-ribose hydrolase; Short=SpyMacroD... Staphylococc... NA 93062 40.4 40.4 23% 0.021 30.37 266 Q5HIW9.1 RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName:... Shigella fle... NA 373384 39.7 39.7 22% 0.022 32.54 177 Q0T5Z6.1 RecName: Full=ADP-ribose glycohydrolase MACROD1; AltName:... Rattus norve... Norway rat 10116 40.0 40.0 22% 0.030 28.45 258 Q8K4G6.2 RecName: Full=Uncharacterized protein Ta1105 [Thermoplasma... Thermoplasma... NA 273075 39.3 39.3 24% 0.030 29.23 196 Q9HJ67.2 RecName: Full=Macro domain-containing protein; AltName:... Acinetobacte... NA 194927 39.3 39.3 31% 0.031 25.60 183 Q93SX7.1 RecName: Full=Protein-ADP-ribose hydrolase; Short=SpyMacroD... Staphylococc... NA 282459 39.7 39.7 23% 0.032 30.37 266 Q6GCE6.1 RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName:... Salmonella e... NA 99287 38.9 38.9 22% 0.034 31.45 179 P67341.1 RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName:... Salmonella e... NA 423368 38.9 38.9 22% 0.034 31.45 179 B4T2X8.1 RecName: Full=Protein-ADP-ribose hydrolase; Short=SpyMacroD... Staphylococc... NA 158878 39.7 39.7 23% 0.034 30.37 266 P67343.1 Alignments: >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Mayaro virus (strain Brazil)] Sequence ID: Q8QZ73.3 Length: 2437 Range 1: 1335 to 1826 Score:1018 bits(2631), Expect:0.0, Method:Compositional matrix adjust., Identities:492/492(100%), Positives:492/492(100%), Gaps:0/492(0%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC Sbjct 1335 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 1394 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR Sbjct 1395 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 1454 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATELVSDELQFEVNLTRVHP 180 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATELVSDELQFEVNLTRVHP Sbjct 1455 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATELVSDELQFEVNLTRVHP 1514 Query 181 DSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETMD 240 DSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETMD Sbjct 1515 DSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETMD 1574 Query 241 NIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPGV 300 NIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPGV Sbjct 1575 NIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPGV 1634 Query 301 QRVKCEKVMLFDAAPPASVSPVQYLTNQSETTISLSSFSITSDSSSLSTFPDLESAEELD 360 QRVKCEKVMLFDAAPPASVSPVQYLTNQSETTISLSSFSITSDSSSLSTFPDLESAEELD Sbjct 1635 QRVKCEKVMLFDAAPPASVSPVQYLTNQSETTISLSSFSITSDSSSLSTFPDLESAEELD 1694 Query 361 HDSQSVRPALNEPDDHQPTPTAELATHPVPPPRPNRARRLAAARVQVQVEVHQPPSNQPT 420 HDSQSVRPALNEPDDHQPTPTAELATHPVPPPRPNRARRLAAARVQVQVEVHQPPSNQPT Sbjct 1695 HDSQSVRPALNEPDDHQPTPTAELATHPVPPPRPNRARRLAAARVQVQVEVHQPPSNQPT 1754 Query 421 KPIPAPRTSLRPVPAPRRYVPRPVVELPWPLETIDVEFGAPTEEESDITFGDFSASEWET 480 KPIPAPRTSLRPVPAPRRYVPRPVVELPWPLETIDVEFGAPTEEESDITFGDFSASEWET Sbjct 1755 KPIPAPRTSLRPVPAPRRYVPRPVVELPWPLETIDVEFGAPTEEESDITFGDFSASEWET 1814 Query 481 ISNSSXLGRAGA 492 ISNSSXLGRAGA Sbjct 1815 ISNSSXLGRAGA 1826 >RecName: Full=Polyprotein nsP1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Ross river virus (STRAIN T48)] Sequence ID: P13888.2 Length: 1149 Range 1: 1 to 538 Score:514 bits(1324), Expect:9e-171, Method:Compositional matrix adjust., Identities:284/546(52%), Positives:354/546(64%), Gaps:62/546(11%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP+Y V+R DI+ E+AVVNAAN +G VGDGVCRAVARKWP +F+ AATPVGTAK V+ Sbjct 1 APSYRVRRTDISGHAEEAVVNAANAKGTVGDGVCRAVARKWPDSFKGAATPVGTAKLVQA 60 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 + +IHAVGPNF+ +EAEGDR+LAAAYRAVA IN +I SVAIPLLSTG+FS GKDR Sbjct 61 NGMNVIHAVGPNFSTVTEAEGDRELAAAYRAVAGIINASNIKSVAIPLLSTGVFSGGKDR 120 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATELVSDELQFEVNLTRVHP 180 V QSL+HL AMDTT+A V IYCRDK WE+KI+ + R+A ELVS+++ E +L RVHP Sbjct 121 VMQSLNHLFTAMDTTDADVVIYCRDKAWEKKIQEAIDRRTAVELVSEDISLESDLIRVHP 180 Query 181 DSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETMD 240 DS LVGR GYS TDG L+SY+EGT+FHQ A+DMAEI+TLWP++QDANE ICLYALGE+MD Sbjct 181 DSCLVGRKGYSITDGKLHSYLEGTRFHQTAVDMAEISTLWPKLQDANEQICLYALGESMD 240 Query 241 NIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPGV 300 +IR +CPVED+DSSTPPKTVPCLCRYAMT ERV RLRM++TK +VCSSF LPKYRI GV Sbjct 241 SIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKAIIVCSSFPLPKYRIEGV 300 Query 301 QRVKCEKVMLFDAAPPASVSPVQYLTNQSET---TISLSSFSITSDSSSL---STFPDLE 354 Q+VKC++V++FD P+ VSP +Y+ + T T+SL S T + S +T+ +E Sbjct 301 QKVKCDRVLIFDQTVPSLVSPRKYIPAAASTHADTVSLDSTVSTGSAWSFPSEATYETME 360 Query 355 SAEELDHDSQSVRPALNEPDDHQPTPTAELATHPVPPPRPNRARRLAAARVQVQV----- 409 E+ H V P A++ H + AARV++ V Sbjct 361 VVAEVHHSEPPVPPPRRR--------RAQVTMHHQELLEVSDMHTPIAARVEIPVYDTAV 412 Query 410 ---EVHQPPSNQPTKPIPAPRTS-LRPVPAPR-------RYVPRPV-------------- 444 V P +++ KPIPAPR + + PVPAPR R P P Sbjct 413 VVERVAIPCTSEYAKPIPAPRAARVVPVPAPRIQRASTYRVSPTPTPRVLRASVCSVTTS 472 Query 445 --VELPWPLETIDV-------EFGAPTE-----EESDITFGDFSASEWETIS----NSSX 486 VE PW E ++V + P E E+ DI FGDF S+ + X Sbjct 473 AGVEFPWAPEDLEVLTEPVHCKMREPVELPWEPEDVDIQFGDFETSDKIQFGDIDFDQFX 532 Query 487 LGRAGA 492 LGRAGA Sbjct 533 LGRAGA 538 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Semliki Forest virus] Sequence ID: P08411.2 Length: 2432 Range 1: 1337 to 1818 Score:526 bits(1355), Expect:1e-167, Method:Compositional matrix adjust., Identities:290/518(56%), Positives:352/518(67%), Gaps:62/518(11%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP+Y VKRADIAT E AVVNAAN RG VGDGVCRAVA+KWP AF+ AATPVGT KTV C Sbjct 1337 APSYRVKRADIATCTEAAVVNAANARGTVGDGVCRAVAKKWPSAFKGAATPVGTIKTVMC 1396 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 +IHAV PNF+ T+EAEGDR+LAA YRAVAAE+NRLS+SSVAIPLLSTG+FS G+DR Sbjct 1397 GSYPVIHAVAPNFSATTEAEGDRELAAVYRAVAAEVNRLSLSSVAIPLLSTGVFSGGRDR 1456 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATELVSDELQFEVNLTRVHP 180 + QSL+HL AMD T+A VTIYCRDK+WE+KI+ + R+A EL++D+++ +L RVHP Sbjct 1457 LQQSLNHLFTAMDATDADVTIYCRDKSWEKKIQEAIDMRTAVELLNDDVELTTDLVRVHP 1516 Query 181 DSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETMD 240 DSSLVGR GYSTTDG+LYSY EGTKF+QAA+DMAEI TLWPR+Q+ANE ICLYALGETMD Sbjct 1517 DSSLVGRKGYSTTDGSLYSYFEGTKFNQAAIDMAEILTLWPRLQEANEQICLYALGETMD 1576 Query 241 NIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPGV 300 NIR++CPV DSDSSTPP+TVPCLCRYAMT ER+ RLR H K VVCSSF LPKY + GV Sbjct 1577 NIRSKCPVNDSDSSTPPRTVPCLCRYAMTAERIARLRSHQVKSMVVCSSFPLPKYHVDGV 1636 Query 301 QRVKCEKVMLFDAAPPASVSPVQY---LTNQSETTISLSSFSITSDSSSLS----TFPDL 353 Q+VKCEK +LFD P+ VSP +Y T+ S+ ++ T+DSSS + + P L Sbjct 1637 QKVKCEKGLLFDPTVPSVVSPRKYAASTTDHSDRSLRGFDLDWTTDSSSTASDTMSLPSL 1696 Query 354 ESAEELDHDSQSVRPALNEPDDH-QPTPTAELAT-------------HPVPPPRPNRARR 399 +S ++D + + P + D H +P A+LA +P+PPPRP RA Sbjct 1697 QSC-DIDSIYEPMAPIVVTADVHPEPAGIADLAADVHPEPADHVDLENPIPPPRPKRAAY 1755 Query 400 LAAARVQVQVEVHQPPSNQPTKPIPAPRTSLRPVPAPRRYVPRPVVELPWPLETIDVEFG 459 LA S +P+PAPR +P PAPR +LP + FG Sbjct 1756 LA--------------SRAAERPVPAPR---KPTPAPRTAFRN---KLP-------LTFG 1788 Query 460 APTEEESD-----ITFGDFSASEWETISNSSXLGRAGA 492 E E D ITFGDF + LGRAGA Sbjct 1789 DFDEHEVDALASGITFGDF--------DDVLRLGRAGA 1818 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Getah virus] Sequence ID: Q5Y389.3 Length: 2467 Range 1: 1333 to 1856 Score:512 bits(1318), Expect:2e-162, Method:Compositional matrix adjust., Identities:286/546(52%), Positives:354/546(64%), Gaps:76/546(13%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP+Y V+RADI+ E+AVVNAAN +G V DGVCRAVA+KWP +F+ AATPVGTAK ++ Sbjct 1333 APSYRVRRADISGHSEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKGAATPVGTAKMIRA 1392 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 D +IHAVGPNF+ +EAEGDR+LAAAYRAVA+ I+ +I SVA+PLLSTG FS GKDR Sbjct 1393 DGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPLLSTGTFSGGKDR 1452 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATELVSDELQFEVNLTRVHP 180 V QSL+HL A+D T+A V IYCRDK WE+KI+ + R+A ELVS+++ E +L RVHP Sbjct 1453 VMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVSEDVTLETDLVRVHP 1512 Query 181 DSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETMD 240 DS LVGR GYS TDG LYSY+EGT+FHQ A+DMAEI+TLWPR+QDANE ICLYALGETMD Sbjct 1513 DSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDANEQICLYALGETMD 1572 Query 241 NIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPGV 300 +IR +CPVED+DSSTPPKTVPCLCRYAMT ERV RLRM++TK+ +VCSSF LPKYRI GV Sbjct 1573 SIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVCSSFPLPKYRIEGV 1632 Query 301 QRVKCEKVMLFDAAPPASVSPVQYLTNQSETTISLSSFSITSDSSSLSTFP--------- 351 Q+VKC++V++FD P+ VSP +Y+ E ++S S TS S+ S FP Sbjct 1633 QKVKCDRVLIFDQTVPSLVSPRKYIQQPPEQLDNVSLTSTTSTGSAWS-FPSETTYETME 1691 Query 352 --------------------------DLESAEELD-HDSQSVRPALNE---PDDHQ--PT 379 DLE EE++ + +Q + E D + P Sbjct 1692 VVAEVHTEPPIPPPRRRRAAVAQLRQDLEVTEEIEPYVTQQAEIMVMERVATTDIRAIPV 1751 Query 380 PTAELATHPVPPPRPNRARRLAAARVQVQVEVHQPPSNQPTKPIPAPR----TSLRPVPA 435 P T PVP P R R++A +PP +P PIPAPR TS P Sbjct 1752 PARRAITMPVPAP---RVRKVAT----------EPPL-EPEAPIPAPRKRRTTSTSPPHN 1797 Query 436 PRRYVPRPVVELPWPLETIDVEFG---------APTEEESDITFGDFSASEWETISNSSX 486 P +VPR VELPW E +D++FG + + I FGD N SX Sbjct 1798 PEDFVPRVPVELPWEPEDLDIQFGDLEPRRRNTRDRDVSTGIQFGDIDF-------NQSX 1850 Query 487 LGRAGA 492 LGRAGA Sbjct 1851 LGRAGA 1856 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Sagiyama virus] Sequence ID: Q9JGL0.3 Length: 2467 Range 1: 1333 to 1856 Score:511 bits(1316), Expect:4e-162, Method:Compositional matrix adjust., Identities:281/545(52%), Positives:353/545(64%), Gaps:74/545(13%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP+Y V+RADI+ E+AVVNAAN +G V DGVCRAVA+KWP +F+ AATPVGTAK ++ Sbjct 1333 APSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKGAATPVGTAKMIRA 1392 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 D +IHAVGPNF+ +EAEGDR+LAAAYRAVA+ I+ +I SVA+PLLSTG FS GKDR Sbjct 1393 DGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPLLSTGTFSGGKDR 1452 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATELVSDELQFEVNLTRVHP 180 V QSL+HL A+D T+A V IYCRDK WE+KI+ + R+A ELVS+++ E +L RVHP Sbjct 1453 VMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVSEDVTLETDLVRVHP 1512 Query 181 DSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETMD 240 DS LVGR GYS TDG LYSY+EGT+FHQ A+DMAEI+TLWPR+QDANE ICLYALGETMD Sbjct 1513 DSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDANEQICLYALGETMD 1572 Query 241 NIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPGV 300 +IR +CPVED+DSSTPPKTVPCLCRYAMT ERV RLRM++TK+ +VCSSF LPKYRI GV Sbjct 1573 SIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVCSSFPLPKYRIEGV 1632 Query 301 QRVKCEKVMLFDAAPPASVSPVQYLTNQSETTISLSSFSITSDSSSLS-----TFPDLES 355 Q+VKC++V++FD P+ VSP +Y+ E ++S S TS S+ S T+ +E Sbjct 1633 QKVKCDRVLIFDQTVPSLVSPRKYIQQPPEQLDNVSLTSTTSTGSAWSLPSETTYETMEV 1692 Query 356 AEELDHD----------------------SQSVRPALNEPDDHQ-------------PTP 380 E+ + ++ + P + + + P P Sbjct 1693 VAEVHTEPPIPPPRRRRAAVAQLRQDLEVTEEIEPYVIQQAEIMVMERVATTDIRAIPVP 1752 Query 381 TAELATHPVPPPRPNRARRLAAARVQVQVEVHQPPSNQPTKPIPAPR----TSLRPVPAP 436 T PVP P R R++A +PPS +P PIPAPR TS P P Sbjct 1753 ARRAITMPVPAP---RVRKVAT----------EPPS-EPEAPIPAPRKRRTTSTTPPHNP 1798 Query 437 RRYVPRPVVELPWPLETIDVEFG---------APTEEESDITFGDFSASEWETISNSSXL 487 +VPR VELPW E +D++FG + + I FGD N SXL Sbjct 1799 GDFVPRVPVELPWEPEDLDIQFGDLEPRRRNTRDWDVSTGIQFGDIDF-------NQSXL 1851 Query 488 GRAGA 492 GRAGA Sbjct 1852 GRAGA 1856 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Ross river virus (STRAIN NB5092)] Sequence ID: P13887.2 Length: 2480 Range 1: 1332 to 1846 Score:500 bits(1288), Expect:3e-158, Method:Compositional matrix adjust., Identities:273/523(52%), Positives:339/523(64%), Gaps:58/523(11%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP+Y V+R DI+ E+AVVNAAN +G VG GVCRAVARKWP +F+ AATPVGTAK V+ Sbjct 1332 APSYRVRRTDISGHAEEAVVNAANAKGTVGVGVCRAVARKWPDSFKGAATPVGTAKLVQA 1391 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 + +IHAVGPNF+ +EAEGDR+LAAAYRAVA IN +I SVAIPLLSTG+FS GKDR Sbjct 1392 NGMNVIHAVGPNFSTVTEAEGDRELAAAYRAVAGIINASNIKSVAIPLLSTGVFSGGKDR 1451 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATELVSDELQFEVNLTRVHP 180 V QSL+HL AMDTT+A V IYCRDK WE+KI+ + R+A ELVS+++ E +L RVHP Sbjct 1452 VMQSLNHLFTAMDTTDADVVIYCRDKAWEKKIQEAIDRRTAVELVSEDISLESDLIRVHP 1511 Query 181 DSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETMD 240 DS LVGR GYS TDG L+SY+EGT+FHQ A+DMAEI+TLWP++QDANE ICLYALGE+MD Sbjct 1512 DSCLVGRKGYSITDGKLHSYLEGTRFHQTAVDMAEISTLWPKLQDANEQICLYALGESMD 1571 Query 241 NIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPGV 300 +IR +CPVED+DSSTPPKTVPCLCRYAMT ERV RLRM++TK +VCSSF LPKYRI GV Sbjct 1572 SIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKAIIVCSSFPLPKYRIEGV 1631 Query 301 QRVKCEKVMLFDAAPPASVSPVQYL---TNQSETTISLSSFSITSDSSSL---STFPDLE 354 Q+VKC++V++FD P+ VSP +Y+ + T+SL S T + S +T+ +E Sbjct 1632 QKVKCDRVLIFDQTVPSLVSPRKYIPAAASMHADTVSLDSTVSTGSAWSFPSEATYETME 1691 Query 355 SAEELDHDSQSVRPALNEPDDHQPTPTAELATHPVPPPRPNRARRLAAARVQVQV----- 409 E+ H V P A++ H + AARV++ V Sbjct 1692 VVAEVHHSEPPVPPP--------RRRRAQVTMHHQELLEVSDMHTPIAARVEIPVYDTAV 1743 Query 410 ---EVHQPPSNQPTKPIPAPR-TSLRPVPAPR-------RYVPRPV-------------- 444 V P +++ PIP PR + PVPAPR R P P Sbjct 1744 VAERVAIPCTSEYATPIPTPRAVRVVPVPAPRIQRASTYRVSPTPTPRVLRASVCSVTTS 1803 Query 445 --VELPWPLETIDV-------EFGAPTE-----EESDITFGDF 473 VE PW E ++V E P E E+ DI FGDF Sbjct 1804 AGVEFPWAPEDLEVLTEPVHCEMREPVELPWEPEDVDIQFGDF 1846 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Chikungunya virus strain S27-African prototype] Sequence ID: Q8JUX6.1 Length: 2474 Range 1: 1334 to 1863 Score:498 bits(1283), Expect:1e-157, Method:Compositional matrix adjust., Identities:265/536(49%), Positives:345/536(64%), Gaps:50/536(9%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP+Y VKR DIA E+ VVNAAN RG GDGVC+AV +KWP++F+N+ATPVGTAKTV C Sbjct 1334 APSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKNSATPVGTAKTVMC 1393 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 +IHAVGPNF+N SE+EGDR+LAAAYR VA E+ RL ++SVAIPLLSTG++S GKDR Sbjct 1394 GTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVYSGGKDR 1453 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATELVSDELQFEVNLTRVHP 180 + QSL+HL AMD+T+A V IYCRDK WE+KI +Q R+ EL+ + + + ++ RVHP Sbjct 1454 LTQSLNHLFTAMDSTDADVVIYCRDKEWEKKISEAIQMRTQVELLDEHISIDCDVVRVHP 1513 Query 181 DSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETMD 240 DSSL GR GYSTT+G LYSY+EGT+FHQ A+DMAEI T+WP+ +ANE +CLYALGE+++ Sbjct 1514 DSSLAGRKGYSTTEGALYSYLEGTRFHQTAVDMAEIYTMWPKQTEANEQVCLYALGESIE 1573 Query 241 NIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPGV 300 +IR +CPV+D+D+S+PPKTVPCLCRYAMTPERVTRLRM+H +VCSSF LPKY+I GV Sbjct 1574 SIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVCSSFPLPKYKIEGV 1633 Query 301 QRVKCEKVMLFDAAPPASVSPVQYLTNQ-------SETTISLSSFSITSDSSSLSTFPDL 353 Q+VKC KVMLFD P+ VSP +Y +Q + T+++ S F ++ D L D Sbjct 1634 QKVKCSKVMLFDHNVPSRVSPREYRPSQESVQEASTTTSLTHSQFDLSVDGKILPVPSD- 1692 Query 354 ESAEELDHDSQSVRPALNEPDDHQ-PTPTAELA--------THPVPPPRPNRARRLAAAR 404 LD D+ ++ PAL++ H P+ T LA T PV PPR R R L Sbjct 1693 -----LDADAPALEPALDDGAIHTLPSATGNLAAVSDWVMSTVPVAPPRRRRGRNLTVTC 1747 Query 405 VQVQVEV-----------HQPPSNQPTKPIPAPRTSLRPVPAPRRYVPRPVVELPWPLET 453 + + + P Q T SL+ P+ + P + P ET Sbjct 1748 DEREGNITPMASVRFFRAELCPVVQETAETRDTAMSLQAPPSTATELSHPPISFGAPSET 1807 Query 454 IDVEFGAPTEEESD------ITFGDF--------SASEWETISNSS---XLGRAGA 492 + FG E E + +TFGDF + S+W T S++ L RAG Sbjct 1808 FPITFGDFNEGEIESLSSELLTFGDFLPGEVDDLTDSDWSTCSDTDDELRLDRAGG 1863 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Chikungunya virus strain Senegal 37997] Sequence ID: Q5XXP4.1 Length: 2474 Range 1: 1334 to 1825 Score:491 bits(1265), Expect:4e-155, Method:Compositional matrix adjust., Identities:253/500(51%), Positives:330/500(66%), Gaps:25/500(5%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP+Y VKR DIA E+ VVNAAN RG GDGVC+AV +KWP++F+N+ATPVGTAKTV C Sbjct 1334 APSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKNSATPVGTAKTVMC 1393 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 +IHAVGPNF+N SE+EGDR+LAAAYR VA E+ RL ++SVAIPLLSTG++S GKDR Sbjct 1394 GTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVYSGGKDR 1453 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATELVSDELQFEVNLTRVHP 180 + QSL+HL A+D+T+A V IYCRDK WE+KI +Q R+ EL+ + + + ++ RVHP Sbjct 1454 LTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVELLDEHISVDCDIIRVHP 1513 Query 181 DSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETMD 240 DSSL GR GYSTT+G+LYSY+EGT+FHQ A+DMAE+ T+WP+ +ANE +CLYALGE+++ Sbjct 1514 DSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQTEANEQVCLYALGESIE 1573 Query 241 NIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPGV 300 +IR +CPV+D+D+S+PPKTVPCLCRYAMTPERVTRLRM+H +VCSSF LPKY+I GV Sbjct 1574 SIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVCSSFPLPKYKIEGV 1633 Query 301 QRVKCEKVMLFDAAPPASVSPVQYLTNQ-------SETTISLSSFSITSDSSSLSTFPDL 353 Q+VKC KVMLFD P+ VSP +Y + Q S T+++ S F ++ D L DL Sbjct 1634 QKVKCSKVMLFDHNVPSRVSPREYKSPQETAQEVSSTTSLTHSQFDLSVDGEELPAPSDL 1693 Query 354 ESAEELDHDSQSVRPALNEP---DDHQPTPTAELATHPVPPPRPNRARRLAAARVQVQVE 410 E+ + + R L P D+ + T PV PPR R + L V Sbjct 1694 EADAPIPEPTPDDRAVLTLPPTIDNFSAVSDWVMNTAPVAPPRRRRGKNL-------NVT 1746 Query 411 VHQPPSN-QPTKPIPAPRTSLRPVPAPRRYVPRPVVELPWPLET------IDVEFGAPTE 463 + N P + R L + + L PL + + FGAP Sbjct 1747 CDEREGNVLPMASVRFFRADLHSIVQETAEIRDTAASLQAPLSVATEPNQLPISFGAPN- 1805 Query 464 EESDITFGDFSASEWETISN 483 E ITFGDF E E++S+ Sbjct 1806 ETFPITFGDFDEGEIESLSS 1825 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Igbo Ora virus] Sequence ID: O90370.1 Length: 2513 Range 1: 1334 to 1657 Score:467 bits(1201), Expect:2e-146, Method:Compositional matrix adjust., Identities:211/324(65%), Positives:262/324(80%), Gaps:0/324(0%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP+Y VKR DIA E+ VVNAAN RG GDGVC+AV RKWP++FRN+ATPVGTAKT+ C Sbjct 1334 APSYRVKRMDIAKNTEECVVNAANPRGVPGDGVCKAVYRKWPESFRNSATPVGTAKTIMC 1393 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 + +IHAVGPNF+N SEAEGDR+LA+AYR VA E++RL +SSVAIPLLSTG++S GKDR Sbjct 1394 GQYPVIHAVGPNFSNYSEAEGDRELASAYREVAKEVSRLGVSSVAIPLLSTGVYSGGKDR 1453 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATELVSDELQFEVNLTRVHP 180 + QSL+HL AAMD+T+A V IYCRDK WE+KI + RS EL+ D + + ++ RVHP Sbjct 1454 LLQSLNHLFAAMDSTDADVVIYCRDKEWEKKITEAISLRSQVELLDDHISVDCDIVRVHP 1513 Query 181 DSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETMD 240 DSSL GR GYST +G LYSY+EGT+FHQ A+DMAEI T+WP+ +ANE +CLYALGE+++ Sbjct 1514 DSSLAGRKGYSTVEGALYSYLEGTRFHQTAVDMAEIYTMWPKQTEANEQVCLYALGESIE 1573 Query 241 NIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPGV 300 ++R +CPV+D+D+S PPKTVPCLCRYAMTPERV RLRM+HT +VCSSF LPKY+I GV Sbjct 1574 SVRQKCPVDDADASFPPKTVPCLCRYAMTPERVARLRMNHTTSIIVCSSFPLPKYKIEGV 1633 Query 301 QRVKCEKVMLFDAAPPASVSPVQY 324 Q+VKC K +LFD P+ VSP Y Sbjct 1634 QKVKCSKALLFDHNVPSRVSPRTY 1657 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [O'nyong-nyong virus strain Gulu] Sequence ID: P13886.2 Length: 2514 Range 1: 1334 to 1657 Score:465 bits(1196), Expect:7e-146, Method:Compositional matrix adjust., Identities:210/324(65%), Positives:261/324(80%), Gaps:0/324(0%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP+Y VKR DIA E+ VVNAAN RG GDGVC+AV RKWP++FRN+ATPVGTAKT+ C Sbjct 1334 APSYRVKRMDIAKNTEECVVNAANPRGVPGDGVCKAVYRKWPESFRNSATPVGTAKTIMC 1393 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 + +IHAVGPNF+N SEAEGDR+LA+ YR VA E++RL +SSVAIPLLSTG++S GKDR Sbjct 1394 GQYPVIHAVGPNFSNYSEAEGDRELASVYREVAKEVSRLGVSSVAIPLLSTGVYSGGKDR 1453 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATELVSDELQFEVNLTRVHP 180 + QSL+HL AAMD+T+A V IYCRDK WE+KI + RS EL+ D + + ++ RVHP Sbjct 1454 LLQSLNHLFAAMDSTDADVVIYCRDKEWEKKITEAISLRSQVELLDDHISVDCDIVRVHP 1513 Query 181 DSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETMD 240 DSSL GR GYST +G LYSY+EGT+FHQ A+DMAEI T+WP+ +ANE +CLYALGE+++ Sbjct 1514 DSSLAGRKGYSTVEGALYSYLEGTRFHQTAVDMAEIYTMWPKQTEANEQVCLYALGESIE 1573 Query 241 NIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPGV 300 ++R +CPV+D+D+S PPKTVPCLCRYAMTPERV RLRM+HT +VCSSF LPKY+I GV Sbjct 1574 SVRQKCPVDDADASFPPKTVPCLCRYAMTPERVARLRMNHTTSIIVCSSFPLPKYKIEGV 1633 Query 301 QRVKCEKVMLFDAAPPASVSPVQY 324 Q+VKC K +LFD P+ VSP Y Sbjct 1634 QKVKCSKALLFDHNVPSRVSPRTY 1657 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [O'nyong-nyong virus strain SG650] Sequence ID: O90368.1 Length: 2513 Range 1: 1334 to 1657 Score:463 bits(1192), Expect:2e-145, Method:Compositional matrix adjust., Identities:209/324(65%), Positives:260/324(80%), Gaps:0/324(0%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP+Y VKR DIA E+ VVNAAN RG GDGVC+AV RKWP++FRN+ATPVGTAKT+ C Sbjct 1334 APSYRVKRMDIAKNTEECVVNAANPRGVPGDGVCKAVYRKWPESFRNSATPVGTAKTIMC 1393 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 + +IHAVGPNF+N SEAEGDR+LA+ YR VA E++RL +SSVAIPLLSTG++S GKDR Sbjct 1394 GQYPVIHAVGPNFSNYSEAEGDRELASVYREVAKEVSRLGVSSVAIPLLSTGVYSGGKDR 1453 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATELVSDELQFEVNLTRVHP 180 + QSL+HL AMD+T+A V IYCRDK WE+KI + RS EL+ D + + ++ RVHP Sbjct 1454 LLQSLNHLFTAMDSTDADVVIYCRDKEWEKKITEAISLRSQVELLDDHISVDCDIVRVHP 1513 Query 181 DSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETMD 240 DSSL GR GYST +G LYSY+EGT+FHQ A+DMAEI T+WP+ +ANE +CLYALGE+++ Sbjct 1514 DSSLAGRKGYSTVEGALYSYLEGTRFHQTAVDMAEIYTMWPKQTEANEQVCLYALGESIE 1573 Query 241 NIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPGV 300 ++R +CPV+D+D+S PPKTVPCLCRYAMTPERV RLRM+HT +VCSSF LPKY+I GV Sbjct 1574 SVRQKCPVDDADASFPPKTVPCLCRYAMTPERVARLRMNHTTSIIVCSSFPLPKYKIEGV 1633 Query 301 QRVKCEKVMLFDAAPPASVSPVQY 324 Q+VKC K +LFD P+ VSP Y Sbjct 1634 QKVKCSKALLFDHNVPSRVSPRTY 1657 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Barmah Forest virus] Sequence ID: P87515.3 Length: 2411 Range 1: 1332 to 1801 Score:421 bits(1082), Expect:2e-130, Method:Compositional matrix adjust., Identities:242/501(48%), Positives:302/501(60%), Gaps:40/501(7%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 APAY VKR DI+ A EDAVVNAAN +G G GVC A+ RKWP AF + ATP GTA + Sbjct 1332 APAYRVKRGDISNAPEDAVVNAANQQGVKGAGVCGAIYRKWPDAFGDVATPTGTAVSKSV 1391 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 + +IHAVGPNF+ SE EGDRDLA+AYRA A + I++VA+PLLSTGI++ GK+R Sbjct 1392 QDKLVIHAVGPNFSKCSEEEGDRDLASAYRAAAEIVMDKKITTVAVPLLSTGIYAGGKNR 1451 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATELVSDELQFEVNLTRVHP 180 V QSL+HL A D T+A VTIYC DKTWE+KIK + +R++ E+V D++Q E L RVHP Sbjct 1452 VEQSLNHLFTAFDNTDADVTIYCMDKTWEKKIKEAIDHRTSVEMVQDDVQLEEELVRVHP 1511 Query 181 DSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETMD 240 SSL GR GYST G ++SY+EGTKFHQ A+D+AE+ LWP ++++NE I Y LGE+MD Sbjct 1512 LSSLAGRKGYSTDSGRVFSYLEGTKFHQTAVDIAEMQVLWPALKESNEQIVAYTLGESMD 1571 Query 241 NIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPGV 300 IR +CP ED+D+STPP+TVPCLCRYAMTPERV RL+ +T F VCSSF+LPKY I GV Sbjct 1572 QIRGKCPTEDTDASTPPRTVPCLCRYAMTPERVYRLKCTNTTQFTVCSSFELPKYHIQGV 1631 Query 301 QRVKCEKVMLFD-AAPPASVSPVQYLTNQSETTISLSSFSITSDSSSLSTFPDLESAEEL 359 QRVKCE++++ D PP P + +TIS +S + DS SLSTF + Sbjct 1632 QRVKCERIIILDPTVPPTYKRPC---IRRYPSTISCNS---SEDSRSLSTFSVSSDS--- 1682 Query 360 DHDSQSVRPALNEPDDHQPTPTAELATHPVPPPRP----NRARRLAAARVQVQVEVHQPP 415 S P D +P P PVP PR V+ EVHQ P Sbjct 1683 ---SIGSLPV----GDTRPIPAPRTIFRPVPAPRAPVLRTTPPPKPPRTFTVRAEVHQAP 1735 Query 416 SNQPTKPIPAPRTSLRPVPAPRRYVPRPVVELPWPLETIDVEFGAPTEEE---SDITFGD 472 P P R R P +FG EE S +TFGD Sbjct 1736 PTPVPPPRPK-----RAAKLAREMHPGFTFG----------DFGEHEVEELTASPLTFGD 1780 Query 473 FSASEWETIS-NSSXLGRAGA 492 F+ E + + XLGRAG Sbjct 1781 FAEGEIQGMGVEFEXLGRAGG 1801 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Eastern equine encephalitis virus (strain PE-3.0815)] Sequence ID: Q306W8.3 Length: 2474 Range 1: 1328 to 1651 Score:387 bits(994), Expect:8e-119, Method:Compositional matrix adjust., Identities:183/325(56%), Positives:235/325(72%), Gaps:2/325(0%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 APAY V R DI+ + ++A+VNAAN++GQ G GVC A+ +KWP AF GTA VK Sbjct 1328 APAYRVIRGDISKSTDEAIVNAANNKGQPGAGVCGALYKKWPGAFDKVPIATGTAHLVKH 1387 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 IIHAVGPNF+ SE EG++ L+ Y +A INR + V+IPLLSTGI++ GKDR Sbjct 1388 TPN-IIHAVGPNFSRVSEVEGNQKLSEVYMDIAKIINRERYNKVSIPLLSTGIYAGGKDR 1446 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIK-TVLQNRSATELVSDELQFEVNLTRVH 179 V QSL+HL AMDTT+A VTIYC DK WE +IK + + S ELV D+ ++ L RVH Sbjct 1447 VMQSLNHLFTAMDTTDADVTIYCLDKQWEARIKDAIARKESVEELVEDDKPVDIELVRVH 1506 Query 180 PDSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETM 239 P SSLVGRPGYST +G ++SY+EGT+FHQ A D+AEI +WP Q+ANE ICLY LGE+M Sbjct 1507 PLSSLVGRPGYSTDEGKVHSYLEGTRFHQTAKDIAEIYAMWPNKQEANEQICLYVLGESM 1566 Query 240 DNIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPG 299 +IR++CPVEDS++S+PP T+PCLC YAMT ERV RLRM + F VCSSFQLPKYRI G Sbjct 1567 TSIRSKCPVEDSEASSPPHTIPCLCNYAMTAERVYRLRMAKNEQFAVCSSFQLPKYRITG 1626 Query 300 VQRVKCEKVMLFDAAPPASVSPVQY 324 VQ+++C K ++F P ++ P ++ Sbjct 1627 VQKIQCNKPVIFSGVVPPAIHPRKF 1651 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Eastern equine encephalitis virus (strain PE-0.0155)] Sequence ID: Q306W6.3 Length: 2471 Range 1: 1328 to 1653 Score:385 bits(990), Expect:3e-118, Method:Compositional matrix adjust., Identities:181/327(55%), Positives:236/327(72%), Gaps:2/327(0%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 APAY V R DI+ + ++ +VNAAN++GQ G GVC A+ +KWP AF A GTA VK Sbjct 1328 APAYRVIRGDISKSTDEVIVNAANNKGQPGAGVCGALYKKWPGAFDKAPIATGTAHLVKH 1387 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 IIHAVGPNF+ SE EG++ L+ Y +A IN+ + V+IPLLSTG+++ GKDR Sbjct 1388 TPN-IIHAVGPNFSRMSEVEGNQKLSEVYMDIAKIINKERYNKVSIPLLSTGVYAGGKDR 1446 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIK-TVLQNRSATELVSDELQFEVNLTRVH 179 V QSL+HL AMDTT+A VTIYC DK WE +IK + + S ELV D+ ++ L RVH Sbjct 1447 VMQSLNHLFTAMDTTDADVTIYCLDKQWETRIKDAIARKESVEELVEDDKPVDIELVRVH 1506 Query 180 PDSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETM 239 P SSLVGRPGYST +G ++SY+EGT+FHQ A D+AEI +WP Q+ANE ICLY LGE+M Sbjct 1507 PQSSLVGRPGYSTNEGKVHSYLEGTRFHQTAKDIAEIYAMWPNKQEANEQICLYVLGESM 1566 Query 240 DNIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPG 299 +IR++CPVE+S++S+PP T+PCLC YAMT ERV RLRM + F VCSSFQLPKYRI G Sbjct 1567 TSIRSKCPVEESEASSPPHTIPCLCNYAMTAERVYRLRMAKNEQFAVCSSFQLPKYRITG 1626 Query 300 VQRVKCEKVMLFDAAPPASVSPVQYLT 326 VQ+++C K ++F P ++ P ++ T Sbjct 1627 VQKIQCNKPVIFSGVVPPAIHPRKFST 1653 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Eastern equine encephalitis virus (strain Florida 91-469)] Sequence ID: Q4QXJ8.3 Length: 2494 Range 1: 1328 to 1659 Score:383 bits(984), Expect:2e-117, Method:Compositional matrix adjust., Identities:181/333(54%), Positives:234/333(70%), Gaps:2/333(0%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 APAY V R DI + ++ +VNAAN++GQ G GVC A+ RKWP AF G A VK Sbjct 1328 APAYRVVRGDITKSNDEVIVNAANNKGQPGSGVCGALYRKWPGAFDKQPVATGKAHLVKH 1387 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 +IHAVGPNF+ SE EGD+ L+ Y +A IN + V+IPLLSTGI++ GKDR Sbjct 1388 SPN-VIHAVGPNFSRLSENEGDQKLSEVYMDIARIINNERFTKVSIPLLSTGIYAGGKDR 1446 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIK-TVLQNRSATELVSDELQFEVNLTRVH 179 V QSL+HL AMDTT+A +TIYC DK WE +IK + + S EL D+ ++ L RVH Sbjct 1447 VMQSLNHLFTAMDTTDADITIYCLDKQWESRIKEAITRKESVEELTEDDRPVDIELVRVH 1506 Query 180 PDSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETM 239 P SSL GRPGYSTT+G +YSY+EGT+FHQ A D+AEI +WP Q+ANE ICLY LGE+M Sbjct 1507 PLSSLAGRPGYSTTEGKVYSYLEGTRFHQTAKDIAEIYAMWPNKQEANEQICLYVLGESM 1566 Query 240 DNIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPG 299 ++IR++CPVE+S++S+PP T+PCLC YAMT ERV RLRM + F VCSSFQLPKYRI G Sbjct 1567 NSIRSKCPVEESEASSPPHTIPCLCNYAMTAERVYRLRMAKNEQFAVCSSFQLPKYRITG 1626 Query 300 VQRVKCEKVMLFDAAPPASVSPVQYLTNQSETT 332 VQ+++C K ++F P ++ P ++ + E T Sbjct 1627 VQKIQCSKPVIFSGTVPPAIHPRKFASVTVEDT 1659 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Venezuelan equine encephalitis virus] Sequence ID: Q8V294.3 Length: 2497 Range 1: 1330 to 1659 Score:381 bits(978), Expect:1e-116, Method:Compositional matrix adjust., Identities:184/330(56%), Positives:232/330(70%), Gaps:5/330(1%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP+Y V R DIATA E +VNAAN +GQ G GVC A+ RK+P++F VG A+ VK Sbjct 1330 APSYHVVRGDIATATEGVIVNAANSKGQPGSGVCGALYRKYPESFDLQPIEVGKARLVKG 1389 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 + ++IHAVGPNFN SE EGD+ LA AY ++A IN + SVAIPLLSTGIF+ KDR Sbjct 1390 NSKHLIHAVGPNFNKVSEVEGDKQLAEAYESIARIINDNNYRSVAIPLLSTGIFAGNKDR 1449 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATELV-----SDELQFEVNL 175 + QSL+HLL A+DTT+A V IYCRDK WE +K V+ R A E + S + + L Sbjct 1450 LMQSLNHLLTALDTTDADVAIYCRDKKWEVTLKEVVARREAVEEICISEDSSVAEPDAEL 1509 Query 176 TRVHPDSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYAL 235 RVHP SSL GR GYST+DG +SY+EGTKFHQAA DMAEI +WP +ANE +CLY L Sbjct 1510 VRVHPKSSLAGRKGYSTSDGKTFSYLEGTKFHQAAKDMAEINAMWPAATEANEQVCLYIL 1569 Query 236 GETMDNIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKY 295 GE+M +IR++CPVE+S++STPP T+PCLC +AMTPERV RL+ + VCSSF LPKY Sbjct 1570 GESMSSIRSKCPVEESEASTPPSTLPCLCIHAMTPERVQRLKASRPEQITVCSSFPLPKY 1629 Query 296 RIPGVQRVKCEKVMLFDAAPPASVSPVQYL 325 RI GVQ+++C +LF P + P +YL Sbjct 1630 RITGVQKIQCSHPILFSPKVPEYIHPRKYL 1659 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Venezuelan equine encephalitis virus (strain Mena II)] Sequence ID: Q9WJC7.3 Length: 2499 Range 1: 1330 to 1661 Score:380 bits(976), Expect:2e-116, Method:Compositional matrix adjust., Identities:184/332(55%), Positives:232/332(69%), Gaps:5/332(1%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP+Y V R DIATA E +VNAAN +GQ G GVC A+ RK+P++F VG A+ VK Sbjct 1330 APSYHVVRGDIATATEGVIVNAANSKGQPGSGVCGALYRKYPESFDLQPIEVGKARLVKG 1389 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 +IIHAVGPNF+ SE EGD+ LA AY ++A IN + SVAIPLLSTGIF+ KDR Sbjct 1390 SSKHIIHAVGPNFSKVSEVEGDKQLAEAYESIAKIINDNNYRSVAIPLLSTGIFAGNKDR 1449 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATELV-----SDELQFEVNL 175 + QSL+HLL A+DTT+A V IYCRDK WE +K V+ R A E + S + + L Sbjct 1450 LMQSLNHLLTALDTTDADVAIYCRDKKWEVTLKEVVARREAVEEICISEDSSVAEPDAEL 1509 Query 176 TRVHPDSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYAL 235 RVHP SSL GR GYST+DG +SY+EGTKFHQAA DMAEI +WP +ANE +CLY L Sbjct 1510 VRVHPKSSLAGRKGYSTSDGKTFSYLEGTKFHQAAKDMAEINAMWPTATEANEQVCLYIL 1569 Query 236 GETMDNIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKY 295 GE+M +IR++CPVE+S++STPP T+PCLC +AMTPERV RL+ + VCSSF LPKY Sbjct 1570 GESMSSIRSKCPVEESEASTPPSTLPCLCIHAMTPERVQRLKASRPEQITVCSSFPLPKY 1629 Query 296 RIPGVQRVKCEKVMLFDAAPPASVSPVQYLTN 327 RI GVQ+++C +LF P + P +YL + Sbjct 1630 RITGVQKIQCSHPILFSPKVPEYIHPRKYLAD 1661 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Venezuelan equine encephalitis virus (strain P676)] Sequence ID: P36328.2 Length: 2493 Range 1: 1330 to 1659 Score:379 bits(974), Expect:4e-116, Method:Compositional matrix adjust., Identities:182/330(55%), Positives:233/330(70%), Gaps:5/330(1%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP+Y V R DIATA E ++NAAN +GQ G GVC A+ +K+P++F VG A+ VK Sbjct 1330 APSYHVVRGDIATATEGVIINAANSKGQPGGGVCGALYKKFPESFDLQPIEVGKARLVKG 1389 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 +IIHAVGPNFN SE EGD+ LA AY ++A +N + SVAIPLLSTGIFS KDR Sbjct 1390 AAKHIIHAVGPNFNKVSEVEGDKQLAEAYESIAKIVNDNNYKSVAIPLLSTGIFSGNKDR 1449 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATE--LVSDELQF---EVNL 175 + QSL+HLL A+DTT+A V IYCRDK WE +K + R A E +SD+ + L Sbjct 1450 LTQSLNHLLTALDTTDADVAIYCRDKKWEMTLKEAVARREAVEEICISDDSSVTEPDAEL 1509 Query 176 TRVHPDSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYAL 235 RVHP SSL GR GYST+DG +SY+EGTKFHQAA D+AEI +WP +ANE +C+Y L Sbjct 1510 VRVHPKSSLAGRKGYSTSDGKTFSYLEGTKFHQAAKDIAEINAMWPVATEANEQVCMYIL 1569 Query 236 GETMDNIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKY 295 GE+M +IR++CPVE+S++STPP T+PCLC +AMTPERV RL+ + VCSSF LPKY Sbjct 1570 GESMSSIRSKCPVEESEASTPPSTLPCLCIHAMTPERVQRLKASRPEQITVCSSFPLPKY 1629 Query 296 RIPGVQRVKCEKVMLFDAAPPASVSPVQYL 325 RI GVQ+++C + +LF PA + P +YL Sbjct 1630 RITGVQKIQCSQPILFSPKVPAYIHPRKYL 1659 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Venezuelan equine encephalitis virus (strain 3880)] Sequence ID: P36327.3 Length: 2485 Range 1: 1330 to 1800 Score:378 bits(970), Expect:1e-115, Method:Compositional matrix adjust., Identities:216/472(46%), Positives:281/472(59%), Gaps:32/472(6%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP+Y V R DIATA E ++NAAN +GQ G GVC A+ +K+P++F VG A+ VK Sbjct 1330 APSYHVVRGDIATATEGVIINAANSKGQPGGGVCGALYKKFPESFDLQPIEVGKARLVKG 1389 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 +IIHAVGPNFN SE EGD+ LA AY ++A +N + SVAIPLLSTGIFS KDR Sbjct 1390 AAKHIIHAVGPNFNKVSEIEGDKQLAEAYESIAKIVNDNNYKSVAIPLLSTGIFSGNKDR 1449 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATE--LVSDELQF---EVNL 175 + QSL+HLL A+DTT+A V IYCRDK WE +K + R A E +SD+ + L Sbjct 1450 LTQSLNHLLTALDTTDADVAIYCRDKKWEMTLKEAVARREAVEEICISDDSSVTEPDAEL 1509 Query 176 TRVHPDSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYAL 235 RVHP SSL GR GYST+DG +SY+EGTKFHQAA D+AEI +WP +ANE +C+Y L Sbjct 1510 VRVHPKSSLAGRKGYSTSDGKTFSYLEGTKFHQAAKDIAEINAMWPVATEANEQVCMYIL 1569 Query 236 GETMDNIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKY 295 GE+M +IR++CPVE+S++STPP T+PCLC +AMTPERV RL+ + VCSSF LPKY Sbjct 1570 GESMSSIRSKCPVEESEASTPPSTLPCLCIHAMTPERVQRLKASRPEQITVCSSFPLPKY 1629 Query 296 RIPGVQRVKCEKVMLFDAAPPASVSPVQYLT-------NQSETTISLSSFSITSDSSSLS 348 RI GVQ+++C + +LF PA + P +YL NQS IT + Sbjct 1630 RITGVQKIQCSQPILFSPKVPAYIHPRKYLVETPTVEENQSTEGTPEQPTLITVGETRTR 1689 Query 349 TFPDLESAEELDH-----DSQSVRPALNEPDDHQPTPTAELATHPVPPPRPNRARRLA-- 401 T + EE D D + + E D H P P+A ++ +P L+ Sbjct 1690 TPEPIIIEEEEDSISLLSDGPTHQVLQVEADIHGP-PSASSSSWSIPHASDFDVDSLSIL 1748 Query 402 ---------AARVQVQVEVHQPPSNQ-PTKPIPAPRTSLR--PVPAPRRYVP 441 + V+ H S + +P+PAPRT R P PAPR P Sbjct 1749 DTLEGASVTSEEASVETNSHFARSMEFLARPVPAPRTVFRNPPQPAPRTRTP 1800 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Venezuelan equine encephalitis virus (strain Trinidad donkey)] Sequence ID: P27282.3 Length: 2493 Range 1: 1330 to 1798 Score:370 bits(951), Expect:5e-113, Method:Compositional matrix adjust., Identities:210/470(45%), Positives:272/470(57%), Gaps:28/470(5%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP+Y V R DIATA E ++NAAN +GQ G GVC A+ +K+P++F VG A+ VK Sbjct 1330 APSYHVVRGDIATATEGVIINAANSKGQPGGGVCGALYKKFPESFDLQPIEVGKARLVKG 1389 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 +IIHAVGPNFN SE EGD+ LA AY ++A +N + SVAIPLLSTGIFS KDR Sbjct 1390 AAKHIIHAVGPNFNKVSEVEGDKQLAEAYESIAKIVNDNNYKSVAIPLLSTGIFSGNKDR 1449 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATE--LVSDELQF---EVNL 175 + QSL+HLL A+DTT+A V IYCRDK WE +K + R A E +SD+ + L Sbjct 1450 LTQSLNHLLTALDTTDADVAIYCRDKKWEMTLKEAVARREAVEEICISDDSSVTEPDAEL 1509 Query 176 TRVHPDSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYAL 235 RVHP SSL GR GYST+DG +SY+EGTKFHQAA D+AEI +WP +ANE +C+Y L Sbjct 1510 VRVHPKSSLAGRKGYSTSDGKTFSYLEGTKFHQAAKDIAEINAMWPVATEANEQVCMYIL 1569 Query 236 GETMDNIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKY 295 GE+M +IR++CPVE+S++STPP T+PCLC +AMTPERV RL+ + VCSSF LPKY Sbjct 1570 GESMSSIRSKCPVEESEASTPPSTLPCLCIHAMTPERVQRLKASRPEQITVCSSFPLPKY 1629 Query 296 RIPGVQRVKCEKVMLFDAAPPASVSPVQYLT--------------NQSETTISLSSFSIT 341 RI GVQ+++C + +LF PA + P +YL NQS IT Sbjct 1630 RITGVQKIQCSQPILFSPKVPAYIHPRKYLVETPPVDETPEPSAENQSTEGTPEQPPLIT 1689 Query 342 SDSSSLSTFPDLESAE------ELDHDSQSVRPALNEPDDHQPTPTAELATHPVPPPRPN 395 D + T + E L D + + E D H P P+ ++ +P Sbjct 1690 EDETRTRTPEPIIIEEEEEDSISLLSDGPTHQVLQVEADIHGP-PSVSSSSWSIPHASDF 1748 Query 396 RARRLAAARVQVQVEVHQPPSNQPTKPIPAPRTSL--RPVPAPRRYVPRP 443 L+ V ++ T A RPVPAPR P Sbjct 1749 DVDSLSILDTLEGASVTSGATSAETNSYFAKSMEFLARPVPAPRTVFRNP 1798 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; AltName: Full=p270 nonstructural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Sindbis virus] Sequence ID: P03317.2 Length: 2513 Range 1: 1348 to 1784 Score:366 bits(939), Expect:2e-111, Method:Compositional matrix adjust., Identities:201/437(46%), Positives:269/437(61%), Gaps:33/437(7%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP+Y KR +IA E+AVVNAAN G+ G+GVCRA+ ++WP +F ++AT GTA+ C Sbjct 1348 APSYRTKRENIADCQEEAVVNAANPLGRPGEGVCRAIYKRWPTSFTDSATETGTARMTVC 1407 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 +IHAVGP+F EAE + L AY AVA +N +I SVAIPLLSTGI++AGKDR Sbjct 1408 LGKKVIHAVGPDFRKHPEAEALKLLQNAYHAVADLVNEHNIKSVAIPLLSTGIYAAGKDR 1467 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQ-NRSATELVSDELQFEVNLTRVH 179 + SL+ L A+D T+A VTIYC DK W+++I LQ S TEL ++++ + L +H Sbjct 1468 LEVSLNCLTTALDRTDADVTIYCLDKKWKERIDAALQLKESVTELKDEDMEIDDELVWIH 1527 Query 180 PDSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETM 239 PDS L GR G+STT G LYSY EGTKFHQAA DMAEI L+P Q++NE +C Y LGETM Sbjct 1528 PDSCLKGRKGFSTTKGKLYSYFEGTKFHQAAKDMAEIKVLFPNDQESNEQLCAYILGETM 1587 Query 240 DNIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPG 299 + IR +CPV+ + SS+PPKT+PCLC YAMTPERV RLR ++ K+ VCSS LPK++I Sbjct 1588 EAIREKCPVDHNPSSSPPKTLPCLCMYAMTPERVHRLRSNNVKEVTVCSSTPLPKHKIKN 1647 Query 300 VQRVKCEKVMLFDAAPPASVSPVQYL-------------------------TNQSETTIS 334 VQ+V+C KV+LF+ PA V +Y+ + T++ Sbjct 1648 VQKVQCTKVVLFNPHTPAFVPARKYIEVPEQPTAPPAQAEEAPEVVATPSPSTADNTSLD 1707 Query 335 LSSFSITSDSSSLSTFPDLESAEE---LDHDSQSVRPALNEPDDHQPTPTAELAT----H 387 ++ S+ D SS + S + DS S P+ E D + A++ Sbjct 1708 VTDISLDMDDSSEGSLFSSFSGSDNSITSMDSWSSGPSSLEIVDRRQVVVADVHAVQEPA 1767 Query 388 PVPPPRPNRARRLAAAR 404 P+PPPR + RLAAAR Sbjct 1768 PIPPPRLKKMARLAAAR 1784 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Aura virus] Sequence ID: Q86924.3 Length: 2499 Range 1: 1346 to 1727 Score:365 bits(936), Expect:5e-111, Method:Compositional matrix adjust., Identities:198/387(51%), Positives:249/387(64%), Gaps:16/387(4%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP+Y VKR +IA E+AVVNAAN RG+ GDGVCRA+ +KWP++F NA T V TA C Sbjct 1346 APSYRVKRMNIADCTEEAVVNAANARGKPGDGVCRAIFKKWPKSFENATTEVETAVMKPC 1405 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 +IHAVGP+F + E + L AY VA +N ISSVAIPLLSTGI++AG DR Sbjct 1406 HNKVVIHAVGPDFRKYTLEEATKLLQNAYHDVAKIVNEKGISSVAIPLLSTGIYAAGADR 1465 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNR-SATELVSDELQFEVNLTRVH 179 + SL L A+D T+A VTIYC DK WEQ+I ++ R TEL +++ + LTRVH Sbjct 1466 LDLSLRCLFTALDRTDADVTIYCLDKKWEQRIADAIRMREQVTELKDPDIEIDEGLTRVH 1525 Query 180 PDSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETM 239 PDS L GYST G LYSY EGTKFHQ A D+AEI L+P VQ ANE ICLY LGE M Sbjct 1526 PDSCLKDHIGYSTQYGKLYSYFEGTKFHQTAKDIAEIRALFPDVQAANEQICLYTLGEPM 1585 Query 240 DNIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPG 299 ++IR +CPVEDS +S PPKT+PCLC YAMT ER+ R+R + + VCSSF LPKYRI Sbjct 1586 ESIREKCPVEDSPASAPPKTIPCLCMYAMTAERICRVRSNSVTNITVCSSFPLPKYRIKN 1645 Query 300 VQRVKCEKVMLFDAAPPASVSPVQYLTNQSETTISLSSFS-ITSDSSSLSTFPDLESAE- 357 VQ+++C KV+LF+ P + P + N+ E ++ + S + SS LS P L +AE Sbjct 1646 VQKIQCTKVVLFNPDVPPYI-PARVYINKDEPPVTPHTDSPPDTCSSRLSLTPTLSNAES 1704 Query 358 --------ELDHDSQSVRPALNEPDDH 376 E+D + S LNEP H Sbjct 1705 DIVSLTFSEIDSELSS----LNEPARH 1727 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Ockelbo virus] Sequence ID: P27283.2 Length: 2515 Range 1: 1348 to 1780 Score:363 bits(933), Expect:1e-110, Method:Compositional matrix adjust., Identities:197/436(45%), Positives:266/436(61%), Gaps:36/436(8%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP+Y KR +IA E+AVVNAAN G+ G+GVCRA+ ++WP +F ++AT GTAK C Sbjct 1348 APSYRTKRENIADCQEEAVVNAANPLGRPGEGVCRAIYKRWPNSFTDSATETGTAKLTVC 1407 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 +IHAVGP+F EAE + L AY AVA +N +I SVAIPLLSTGI++AGKDR Sbjct 1408 HGKKVIHAVGPDFRKHPEAEALKLLQNAYHAVADLVNEHNIKSVAIPLLSTGIYAAGKDR 1467 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQ-NRSATELVSDELQFEVNLTRVH 179 + SL+ L A+D T+A VTIYC DK W+++I VLQ S TEL ++++ + L +H Sbjct 1468 LEVSLNCLTTALDRTDADVTIYCLDKKWKERIDAVLQLKESVTELKDEDMEIDDELVWIH 1527 Query 180 PDSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETM 239 PDS L GR G+STT G LYSY EGTKFHQAA DMAEI L+P Q++NE +C Y LGETM Sbjct 1528 PDSCLKGRKGFSTTKGKLYSYFEGTKFHQAAKDMAEIKVLFPNDQESNEQLCAYILGETM 1587 Query 240 DNIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPG 299 + IR +CPV+ + SS+PPKT+PCLC YAMTPERV RLR ++ K+ VCSS LPKY+I Sbjct 1588 EAIREKCPVDHNPSSSPPKTLPCLCMYAMTPERVHRLRSNNVKEVTVCSSTPLPKYKIKN 1647 Query 300 VQRVKCEKVMLFDAAPPASVSPVQYL-----------TNQSETTISLSSFSITSDSSSLS 348 VQ+V+C KV+LF+ PA V +Y+ ++ + +D++SL Sbjct 1648 VQKVQCTKVVLFNPHTPAFVPARKYIEVPEQPAAPPAQDEEAPEAVATPAPPAADNTSLD 1707 Query 349 TFPDLESAEELDHDSQSVRPALNEPDDHQPTPTAELATHP-------------------- 388 + + ++D S+ + D+ T ++ P Sbjct 1708 V---TDISLDMDDSSEGSLFSSFSGSDNSITCMDRWSSGPSSLDRRQVVVADVHAVQEPA 1764 Query 389 -VPPPRPNRARRLAAA 403 +PPPR + RLAAA Sbjct 1765 PIPPPRLKKMARLAAA 1780 >RecName: Full=Polyprotein nsP1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Middelburg virus] Sequence ID: P03318.2 Length: 995 Range 1: 1 to 385 Score:338 bits(868), Expect:5e-105, Method:Compositional matrix adjust., Identities:201/415(48%), Positives:253/415(60%), Gaps:34/415(8%) Query 82 DRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDRVHQSLSHLLAAMDTTEARVTI 141 D DLAA YRAVA+ + ++ ++AIPLLSTG F+ GKDRV QSL+HL A+DTT+ VTI Sbjct 1 DADLAAVYRAVASLADE-TVRTMAIPLLSTGTFAGGKDRVLQSLNHLFTALDTTDVDVTI 59 Query 142 YCRDKTWEQKIKTVLQNRSATELVSDELQFEVNLTRVHPDSSLVGRPGYSTTDGTLYSYM 201 YCRDK+WE+KI+ + R+ATEL+ D+ LTRVHPDS LVGR G+ST DG L+SY+ Sbjct 60 YCRDKSWEKKIQEAIDMRTATELLDDDTTVMKELTRVHPDSCLVGRSGFSTVDGRLHSYL 119 Query 202 EGTKFHQAALDMAEITTLWPRVQDANEHICLYALGETMDNIRARCPVEDSDSSTPPKTVP 261 EGT+FHQ A+D+AE TLWPR ++ANE I Y LGE+M+ IR +CPV+D+DSS PP TVP Sbjct 120 EGTRFHQTAVDVAERPTLWPRREEANEQITHYVLGESMEAIRTKCPVDDTDSSAPPCTVP 179 Query 262 CLCRYAMTPERVTRLRMHHTKDFVVCSSFQLPKYRIPGVQRVKCEKVMLFDAAPPASVSP 321 CLCRYAMTPERV RLR K F VCSSF LPKY+IPGVQRV C VMLF+ PA VSP Sbjct 180 CLCRYAMTPERVHRLRAAQVKQFTVCSSFPLPKYKIPGVQRVACSAVMLFNHDVPALVSP 239 Query 322 VQYLTNQSETTISLSSFSITSDSSSLSTFPDLESAEELDHDSQSVRPALNEPDDHQPTPT 381 +Y + S S S+ DL+ D + + + P QP P Sbjct 240 RKYREPSISSESSSSGLSVF----------DLDIGS--DSEYEPMEPV-------QPEPL 280 Query 382 AELA-THPVPPPRPNRARRLAAARVQVQVEVHQPP---SNQPTKPIPAPRTSLRPVPAPR 437 +LA P R R +AA R P + P+PAPRT PV PR Sbjct 281 IDLAVVEETAPVRLERVAPVAAPR-----RARATPFTLEQRVVAPVPAPRTM--PVRPPR 333 Query 438 RYVPRPVVELPWPLETIDVEFGAPTEEESDITFGDFSASEWETISNSSXLGRAGA 492 R + P + D++ D+TFGDF A E+E ++ S+XL RAGA Sbjct 334 R--KKAATRTPERISFGDLDAECMAIINDDLTFGDFGAGEFERLT-SAXLDRAGA 385 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Sleeping disease virus] Sequence ID: Q8QL53.1 Length: 2593 Range 1: 1421 to 1759 Score:234 bits(597), Expect:6e-66, Method:Compositional matrix adjust., Identities:140/344(41%), Positives:181/344(52%), Gaps:24/344(6%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP Y V +I TA E+ +VNAAN G+ GDGVC A+ + AF N A G A V+ Sbjct 1421 APGYRVLNKNIITAEEEVLVNAANSNGRPGDGVCGALYGAFGDAFPNGAIGAGNAVLVRG 1480 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 E IIHA G +F E G R L AAYRA A + I+S AIPLLST IFS G++R Sbjct 1481 LEATIIHAAGADFREVDEETGARQLRAAYRAAATLVTANGITSAAIPLLSTHIFSNGRNR 1540 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRS-----------------ATE 163 + QS L+ A DTTE VTIYC +I+ ++ + + A Sbjct 1541 LEQSFGALVEAFDTTECDVTIYCLANNMAARIQQLIDDHAREEFDEEVVVEEEEEHEANA 1600 Query 164 LVSDEL--QFEVNLTRVHPDSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWP 221 + E F V S+L GRPGYS T G S GTKFH+AA+ M+ I WP Sbjct 1601 MCDTETLSSFGDETVWVPKHSTLAGRPGYSATYGDRRSLFVGTKFHRAAVAMSSIEAAWP 1660 Query 222 RVQDANEHICLYALGETMDNIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHT 281 R ++AN + Y G+ + ++ CPV D PP ++PC C YAMTPERVT L+ Sbjct 1661 RTKEANAKLIEYIRGQHLVDVLKSCPVNDIPVGRPPSSLPCGCIYAMTPERVTVLKQRPQ 1720 Query 282 KDFVVCSSFQLPKYRIPGVQRVKCEKVMLFDAAPPASVSPVQYL 325 + FVVCS+F+LP I V +V+C AP PV+YL Sbjct 1721 EGFVVCSAFKLPLTNIQDVTKVECTV-----RAPAEEPRPVRYL 1759 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Salmon pancreas disease virus] Sequence ID: Q8JJX1.1 Length: 2601 Range 1: 1422 to 1745 Score:229 bits(585), Expect:2e-64, Method:Compositional matrix adjust., Identities:135/327(41%), Positives:175/327(53%), Gaps:25/327(7%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC 60 AP Y V +I TA E+ +VNAAN G+ GDGVC A+ + AF N A G A V+ Sbjct 1422 APGYRVLNRNIITAEEEVLVNAANSNGRPGDGVCGALYGAFGDAFPNGAIGAGNAVLVRG 1481 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 E IIHA G +F E G R L AAYRA A + I+S AIPLLST IFS G++R Sbjct 1482 LEATIIHAAGADFREVDEETGARQLRAAYRAAATLVTANGITSAAIPLLSTHIFSNGRNR 1541 Query 121 VHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNR--------------------- 159 + QS S L+ A DTTE VTIYC +I+ ++ Sbjct 1542 LEQSFSALVEAFDTTECDVTIYCLANNMAARIQQLIDAHAREEFDEEVVVEEEEEHEADA 1601 Query 160 -SATELVSDELQFEVNLTRVHPDSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITT 218 S TE +S F V S+L GRPGYS G S GTKFH+AA+ M+ I Sbjct 1602 MSDTETLS---SFGDETVWVPKHSTLAGRPGYSAYYGDRRSLFVGTKFHRAAVAMSSIEA 1658 Query 219 LWPRVQDANEHICLYALGETMDNIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRM 278 WP+ ++AN + Y G+ + ++ CPV+D PP ++PC C YAMTPERVT L+ Sbjct 1659 AWPKTKEANAKLIEYIRGQHLVDVLKSCPVDDIPVGRPPSSLPCGCIYAMTPERVTVLKQ 1718 Query 279 HHTKDFVVCSSFQLPKYRIPGVQRVKC 305 + FVVCS+F+LP I V +V+C Sbjct 1719 RPQEGFVVCSAFKLPLTNIQDVTKVEC 1745 >RecName: Full=Uncharacterized protein Saci_1252 [Sulfolobus acidocaldarius DSM 639] Sequence ID: Q4J9D2.1 Length: 181 Range 1: 15 to 132 Score:62.0 bits(149), Expect:5e-10, Method:Compositional matrix adjust., Identities:42/122(34%), Positives:56/122(45%), Gaps:16/122(13%) Query 6 VKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAF---------RNAATPVGTAK 56 ++ DI DA+VNAAN G GV A+ R RN PVG Sbjct 15 LENGDITKVEADAIVNAANSYLSHGGGVALAIVRSGGYIIQEESDEYVRRNGPVPVGEVA 74 Query 57 TV---KCDETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGI 113 K Y+IHAVGP + EGD L +A R + + L +SS+A+P +STGI Sbjct 75 VTTAGKLKARYVIHAVGPRYG----IEGDDKLESAIRRSLEKADELKLSSIALPAISTGI 130 Query 114 FS 115 + Sbjct 131 YG 132 >RecName: Full=Uncharacterized protein SSO2899 [Saccharolobus solfataricus P2] Sequence ID: Q97UU4.1 Length: 177 Range 1: 14 to 177 Score:61.2 bits(147), Expect:8e-10, Method:Compositional matrix adjust., Identities:47/168(28%), Positives:77/168(45%), Gaps:21/168(12%) Query 8 RADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAA---------TPVGTAKTV 58 + DI DA+VNAAN Q G GV A+ RK + + PVG Sbjct 14 KGDITEIEADAIVNAANSYLQHGGGVAYAIVRKGGYIIQKESDEYVKKFGPVPVGEVAVT 73 Query 59 ---KCDETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 K Y+IHAVGP + EG+ L +A + + LS+SS+A+P +STGI+ Sbjct 74 SAGKLKAKYVIHAVGPRYG----IEGEDKLESAIFKSLLKADELSLSSIAMPAISTGIYG 129 Query 116 AGKDRVHQSLSHLLAAMDTTEAR---VTIYCRDK--TWEQKIKTVLQN 158 + + ++++L R + +Y +D ++ ++L+N Sbjct 130 YPFEICARIMANVLKGYKPKTLRKVMICLYTKDAYDVFKSIFNSILKN 177 >RecName: Full=Uncharacterized protein STK_23830 [Sulfurisphaera tokodaii str. 7] Sequence ID: Q96XY5.1 Length: 182 Range 1: 7 to 122 Score:57.8 bits(138), Expect:2e-08, Method:Compositional matrix adjust., Identities:41/120(34%), Positives:57/120(47%), Gaps:16/120(13%) Query 8 RADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAA-------TPV---GTAKT 57 + DI +A+VNAAN + G GV RA+ K + + PV G A T Sbjct 7 KGDITEIEAEAIVNAANSYLEHGGGVARAIVEKGGYIIQKESREYVRKYGPVPTGGVAVT 66 Query 58 V--KCDETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 K Y+IHAVGP + EG+ L A R + L +SS+A+P +STGI+ Sbjct 67 SAGKLKAKYVIHAVGPRYG----IEGEEKLEEAIRNALRKAEELKLSSIALPAISTGIYG 122 >RecName: Full=Uncharacterized protein PAE1111 [Pyrobaculum aerophilum str. IM2] Sequence ID: Q8ZXT3.1 Length: 182 Range 1: 14 to 129 Score:55.1 bits(131), Expect:1e-07, Method:Compositional matrix adjust., Identities:40/120(33%), Positives:53/120(44%), Gaps:16/120(13%) Query 8 RADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAF---------RNAATPVGTAKTV 58 R DI DA+VNAAN + G GV A+ RK Q ++ PVG Sbjct 14 RGDITEVEADAIVNAANSYLEHGGGVAGAIVRKGGQVIQEESREWVRKHGPVPVGDVAVT 73 Query 59 ---KCDETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 + Y+IHAVGP E LA A + + L + S+A+P +STGIF Sbjct 74 SAGRLKAKYVIHAVGPRCG----VEPIEKLAEAVKNALLKAEELGLVSIALPAISTGIFG 129 >RecName: Full=Uncharacterized protein FN1951 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] Sequence ID: Q8RHQ2.1 Length: 175 Range 1: 20 to 124 Score:53.9 bits(128), Expect:2e-07, Method:Compositional matrix adjust., Identities:35/106(33%), Positives:50/106(47%), Gaps:9/106(8%) Query 17 DAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKCDET--------YIIHA 68 +A+VNAAN ++G GVC A+ + +G T + T YIIH Sbjct 20 EAIVNAANSSLEMGGGVCGAIFKAAGSELAQECKEIGGCNTGEAVITKGYNLPNKYIIHT 79 Query 69 VGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIF 114 VGP ++ E +R LA+AY N I +A P +STGI+ Sbjct 80 VGPRYSTGENREAER-LASAYYESLKLANEKGIRRIAFPSISTGIY 124 >RecName: Full=Uncharacterized protein TM_0508 [Thermotoga maritima MSB8] Sequence ID: Q9WYX8.1 Length: 599 Range 1: 431 to 556 Score:56.6 bits(135), Expect:3e-07, Method:Compositional matrix adjust., Identities:39/127(31%), Positives:57/127(44%), Gaps:13/127(10%) Query 6 VKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAA---------TPVGTAK 56 + + DI DA+VNAAN + G GV A+ R + + P G A Sbjct 431 IVKGDITREEVDAIVNAANEYLKHGGGVAGAIVRAGGSVIQEESDRIVQERGRVPTGEAV 490 Query 57 TV---KCDETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGI 113 K Y+IH VGP + S E + A Y A+ + L + S+++P +STGI Sbjct 491 VTSAGKLKAKYVIHTVGPVWRGGSHGEDELLYKAVYNAL-LRAHELKLKSISMPAISTGI 549 Query 114 FSAGKDR 120 F K+R Sbjct 550 FGFPKER 556 >RecName: Full=Macro domain-containing protein DR_2288 [Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539] Sequence ID: Q9RS39.1 Length: 170 Range 1: 7 to 129 Score:52.0 bits(123), Expect:1e-06, Method:Compositional matrix adjust., Identities:48/128(38%), Positives:59/128(46%), Gaps:20/128(15%) Query 8 RADIATAIEDAVVNAANHRGQVGDGV----CRAVARKWPQAFR-NAATPVGTAKTV---- 58 + DIA DAVV AAN + G GV RA + QA R TP GTA Sbjct 7 QGDIAHQPVDAVVTAANKQLMGGGGVDGVIHRAAGPRLLQAIRPIGGTPTGTAVITPAFD 66 Query 59 --KCDETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSIS----SVAIPLLSTG 112 + Y+IHAVGP + E + LA AYR E RL + SVA P +STG Sbjct 67 LERQGVKYVIHAVGPIWRGGQHGEAEL-LAGAYR----ESLRLGVENGCRSVAFPSISTG 121 Query 113 IFSAGKDR 120 ++ DR Sbjct 122 VYGYPLDR 129 >RecName: Full=Macro domain-containing protein MA_1614 [Methanosarcina acetivorans C2A] Sequence ID: Q8TQD0.1 Length: 195 Range 1: 35 to 192 Score:50.8 bits(120), Expect:4e-06, Method:Compositional matrix adjust., Identities:47/159(30%), Positives:71/159(44%), Gaps:12/159(7%) Query 10 DIATAIEDAVVNAANHRGQVGDGVCRAVARK-WPQAFRNAAT----PVGTAKTVK---CD 61 DI DA+VNAAN+ G GV A+ R P T P G AK K Sbjct 35 DITELKVDAIVNAANNTLLGGGGVDGAIHRAAGPGLLEECRTLNGCPTGEAKITKGYLLP 94 Query 62 ETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDRV 121 Y+IH VGP + ++ E D LA+ YR + + ++A P +STG + +R Sbjct 95 AKYVIHTVGPIWQEGTKGE-DEFLASCYRKSLELARKYDVKTIAFPTISTGAYGFPSERA 153 Query 122 HQ-SLSHLLAAMDTTEA--RVTIYCRDKTWEQKIKTVLQ 157 + ++S + + E V + C +K + IK L+ Sbjct 154 ARIAVSQVKEFLKVNELPEIVFLVCYNKEACKNIKKALE 192 >RecName: Full=Macro domain-containing protein RSc0334 [Ralstonia pseudosolanacearum GMI1000] Sequence ID: Q8Y2K1.1 Length: 171 Range 1: 12 to 126 Score:49.7 bits(117), Expect:8e-06, Method:Compositional matrix adjust., Identities:39/116(34%), Positives:53/116(45%), Gaps:9/116(7%) Query 8 RADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKCDET---- 63 RADI T DA+VNAAN G GV A+ R A + +T + T Sbjct 12 RADITTLACDAIVNAANSALLGGGGVDGAIHRAAGPELLEACRALHGCRTGQAKITPGFL 71 Query 64 ----YIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 YIIH VGP + + E LAA YR A + + ++A P +STG++ Sbjct 72 LPARYIIHTVGPIWRGGRQDEAAL-LAACYRNSLALAKQHDVRTIAFPCISTGVYG 126 >RecName: Full=Macro domain-containing protein VPA0103 [Vibrio parahaemolyticus RIMD 2210633] Sequence ID: Q87JZ5.1 Length: 170 Range 1: 3 to 126 Score:48.1 bits(113), Expect:2e-05, Method:Compositional matrix adjust., Identities:39/126(31%), Positives:59/126(46%), Gaps:15/126(11%) Query 3 AYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKC-- 60 A ++ + DI TA DA+VNAAN R G GV A+ R A NA V ++C Sbjct 3 AISLVQGDITTAHVDAIVNAANPRMLGGGGVDGAIHRAAGPALINACYAVDDVDGIRCPF 62 Query 61 -----------DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLL 109 + Y+IHAVGP ++ ++ + L +AY+ SVA+P + Sbjct 63 GDARITEAGNLNARYVIHAVGPIYDKFADPK--TVLESAYQRSLDLALANHCQSVALPAI 120 Query 110 STGIFS 115 S G++ Sbjct 121 SCGVYG 126 >RecName: Full=Macro domain-containing protein MM_0177 [Methanosarcina mazei Go1] Sequence ID: Q8Q0F9.1 Length: 187 Range 1: 26 to 183 Score:48.1 bits(113), Expect:3e-05, Method:Compositional matrix adjust., Identities:49/159(31%), Positives:67/159(42%), Gaps:12/159(7%) Query 9 ADIATAIEDAVVNAANHRGQVGDGVCRAVAR-KWPQAFRNAAT----PVGTAKTVK---C 60 DI DA+VNAAN+ G GV A+ R P T P G AK Sbjct 26 GDIVKMRVDAIVNAANNTLLGGGGVDGAIHRAAGPALLEECKTLNGCPTGEAKITSGYLL 85 Query 61 DETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDR 120 YIIH VGP + + E D LA+ YR I ++A P +STG + +R Sbjct 86 PAKYIIHTVGPVWQGGEKGE-DELLASCYRKSLELARDYKIKTIAFPAISTGAYGFPSER 144 Query 121 VHQ-SLSHLLAAMDTTEARVTIY--CRDKTWEQKIKTVL 156 ++S + + E T+Y C +K + IK L Sbjct 145 AAGIAVSQVKEFLQKNEIPETVYLVCYNKDSCKSIKKAL 183 >RecName: Full=Macro domain-containing protein PG1779 [Porphyromonas gingivalis W83] Sequence ID: Q7MTZ7.1 Length: 164 Range 1: 6 to 127 Score:47.4 bits(111), Expect:4e-05, Method:Compositional matrix adjust., Identities:39/123(32%), Positives:56/123(45%), Gaps:9/123(7%) Query 6 VKRADIATAIEDAVVNAANHRGQVGDGVCRAVAR-KWPQAFRNAAT----PVGTAKTVKC 60 + DI DA+VNAANH G GV A+ R P+ T P G +K Sbjct 6 ITVGDITRFEGDAIVNAANHTLLGGGGVDGAIHRAAGPELLEECRTLNGCPTGESKITGG 65 Query 61 DET---YIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAG 117 Y+IH VGP ++ E + LA+ YR + + S+A P +STG++ Sbjct 66 YNLPAQYVIHTVGPVWHGGQHGEPEL-LASCYRTSLSIALDKGLKSIAFPCISTGVYRYP 124 Query 118 KDR 120 KD+ Sbjct 125 KDQ 127 >RecName: Full=O-acetyl-ADP-ribose deacetylase 1; AltName: Full=Regulator of RNase III activity 1 [Pantoea vagans C9-1] Sequence ID: E1SDF1.1 Length: 171 Range 1: 6 to 159 Score:46.6 bits(109), Expect:9e-05, Method:Compositional matrix adjust., Identities:47/156(30%), Positives:67/156(42%), Gaps:18/156(11%) Query 6 VKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWP-------QAFRN--AATPVGTAK 56 V + DI +A+VNAAN G GV A+ R Q RN VG A Sbjct 6 VIQGDITKVSAEAIVNAANSSLLGGGGVDGAIHRAGGPVILAECQLIRNRQGGCKVGDAV 65 Query 57 TVKCDET---YIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGI 113 Y+IH VGP +++ E D L AY++ ++ I +V+ P +STGI Sbjct 66 ITGAGNLPADYVIHTVGPRWSDGRHDE-DALLKRAYQSCFKLVDYHGIKTVSFPNISTGI 124 Query 114 FSAGKDRVH----QSLSHLLAAMDTTEARVTIYCRD 145 + K+R + H +A T E V + C D Sbjct 125 YGFPKERAATIALDVIKHCIAENRTLE-NVNLVCFD 159 >RecName: Full=Macro domain-containing protein CT2219 [Chlorobaculum tepidum TLS] Sequence ID: Q8KAE4.1 Length: 172 Range 1: 11 to 125 Score:46.2 bits(108), Expect:1e-04, Method:Compositional matrix adjust., Identities:41/122(34%), Positives:59/122(48%), Gaps:21/122(17%) Query 8 RADIATAIEDAVVNAANHRGQVGDGVCRAVAR----KWPQAFRN-AATPVGTAKTVKCDE 62 +ADI + DA+VNAAN G GV A+ R K +A R G AK K Sbjct 11 KADITSLTVDAIVNAANTSLLGGGGVDGAIHRAAGPKLLEACRELGGCLTGEAKITKGYR 70 Query 63 ---TYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSIS------SVAIPLLSTGI 113 T++IH VGP ++ + E + LA+ YR N L ++ ++A P +STGI Sbjct 71 LPATFVIHTVGPVWHGGNHGEAEL-LASCYR------NSLKLAIEHHCRTIAFPSISTGI 123 Query 114 FS 115 + Sbjct 124 YG 125 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain RN-UK86] Sequence ID: Q8BCR0.1 Length: 2116 Range 1: 836 to 967 Score:48.1 bits(113), Expect:2e-04, Method:Compositional matrix adjust., Identities:48/135(36%), Positives:60/135(44%), Gaps:14/135(10%) Query 19 VVNAANHRGQVGDGVCRAV-----ARKWPQAFRNAATPVGTAKTV---KCDETYIIHAVG 70 VVNAAN G GVC A+ A R A P G A C T+IIHAV Sbjct 836 VVNAANEGLLAGSGVCGAIFANATAALAADCRRLAPCPTGEAVATPGHGCGYTHIIHAVA 895 Query 71 PNFNNTSEA--EGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDRVHQSLSHL 128 P A EG+ L AYR++ A + VA PLL G++ +SL Sbjct 896 PRRPRDPAALEEGEALLERAYRSIVALAAARRWACVACPLLGAGVYGWS---AAESLRAA 952 Query 129 LAAMDTTEA-RVTIY 142 LAA T A RV+++ Sbjct 953 LAATRTEPAERVSLH 967 >RecName: Full=ADP-ribose glycohydrolase MACROD2; AltName: Full=MACRO domain-containing protein 2; AltName: Full=O-acetyl-ADP-ribose deacetylase MACROD2; AltName: Full=[Protein ADP-ribosylaspartate] hydrolase MACROD2; AltName: Full=[Protein ADP-ribosylglutamate] hydrolase MACROD2 [Homo sapiens] Sequence ID: A1Z1Q3.2 Length: 425 Range 1: 76 to 191 Score:47.0 bits(110), Expect:2e-04, Method:Compositional matrix adjust., Identities:39/117(33%), Positives:49/117(41%), Gaps:10/117(8%) Query 8 RADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAF----RN-AATPVGTAKTVKCD- 61 R DI DA+VNAAN G GV + R RN G AK + C Sbjct 76 RGDITLLEVDAIVNAANASLLGGGGVDGCIHRAAGPCLLAECRNLNGCDTGHAK-ITCGY 134 Query 62 ---ETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 Y+IH VGP DLA Y++ + +I SVA P +STGI+ Sbjct 135 DLPAKYVIHTVGPIARGHINGSHKEDLANCYKSSLKLVKENNIRSVAFPCISTGIYG 191 >RecName: Full=O-acetyl-ADP-ribose deacetylase 2; AltName: Full=Regulator of RNase III activity 2 [Pantoea vagans C9-1] Sequence ID: E1PL40.1 Length: 171 Range 1: 6 to 159 Score:44.7 bits(104), Expect:3e-04, Method:Compositional matrix adjust., Identities:44/156(28%), Positives:65/156(41%), Gaps:18/156(11%) Query 6 VKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWP-------QAFRN--AATPVGTAK 56 V + DI +A++N AN G GV A+ R QA R+ VG A Sbjct 6 VIQGDITNIASEAIINVANSSLLGGGGVDGAIHRAGGPVILAECQAIRSRQGGCKVGEAV 65 Query 57 TVKCDET---YIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGI 113 Y+IH VGP +++ E D L + Y + + I +V+ P +STGI Sbjct 66 ITGAGTLPADYVIHTVGPRWSDGRHNE-DTQLKSVYLSCFKLVGHHGIKTVSFPNISTGI 124 Query 114 FSAGKDRVH----QSLSHLLAAMDTTEARVTIYCRD 145 + K R + H +A T E +V + C D Sbjct 125 YGFPKKRAAAIALDVIKHCIAENRTIE-KVNLVCFD 159 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain TO-336] Sequence ID: Q99IE5.1 Length: 2116 Range 1: 836 to 942 Score:47.0 bits(110), Expect:4e-04, Method:Compositional matrix adjust., Identities:39/107(36%), Positives:47/107(43%), Gaps:10/107(9%) Query 19 VVNAANHRGQVGDGVCRAV-----ARKWPQAFRNAATPVGTAKTV---KCDETYIIHAVG 70 VVNAAN G GVC A+ A R A P G A C T+IIHAV Sbjct 836 VVNAANEGLLAGSGVCGAIFANATAALAADCRRLAPCPTGEAVATPGHGCGYTHIIHAVA 895 Query 71 PNFNNTSEA--EGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 P A EG+ L AYR++ A + VA PLL G++ Sbjct 896 PRRPRDPAALEEGEALLERAYRSIVALAAARRWACVACPLLGAGVYG 942 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain TO-336 vaccine] Sequence ID: Q99IE7.1 Length: 2116 Range 1: 836 to 942 Score:46.6 bits(109), Expect:4e-04, Method:Compositional matrix adjust., Identities:39/107(36%), Positives:47/107(43%), Gaps:10/107(9%) Query 19 VVNAANHRGQVGDGVCRAV-----ARKWPQAFRNAATPVGTAKTV---KCDETYIIHAVG 70 VVNAAN G GVC A+ A R A P G A C T+IIHAV Sbjct 836 VVNAANEGLLAGSGVCGAIFANATAALAADCRRLAPCPTGEAVATPGHGCGYTHIIHAVA 895 Query 71 PNFNNTSEA--EGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 P A EG+ L AYR++ A + VA PLL G++ Sbjct 896 PRRPRDPAALEEGEALLERAYRSIVALAAARRWACVACPLLGAGVYG 942 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain Therien] Sequence ID: P13889.5 Length: 2116 Range 1: 836 to 942 Score:46.6 bits(109), Expect:5e-04, Method:Compositional matrix adjust., Identities:39/107(36%), Positives:47/107(43%), Gaps:10/107(9%) Query 19 VVNAANHRGQVGDGVCRAV-----ARKWPQAFRNAATPVGTAKTV---KCDETYIIHAVG 70 VVNAAN G GVC A+ A R A P G A C T+IIHAV Sbjct 836 VVNAANEGLLAGSGVCGAIFANATAALAANCRRLAPCPTGEAVATPGHGCGYTHIIHAVA 895 Query 71 PNFNNTSEA--EGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 P A EG+ L AYR++ A + VA PLL G++ Sbjct 896 PRRPRDPAALEEGEALLERAYRSIVALAAARRWACVACPLLGAGVYG 942 >RecName: Full=ADP-ribose glycohydrolase MACROD2; AltName: Full=MACRO domain-containing protein 2; AltName: Full=O-acetyl-ADP-ribose deacetylase MACROD2; AltName: Full=[Protein ADP-ribosylaspartate] hydrolase MACROD2; AltName: Full=[Protein ADP-ribosylglutamate] hydrolase MACROD2 [Mus musculus] Sequence ID: Q3UYG8.1 Length: 475 Range 1: 76 to 191 Score:46.2 bits(108), Expect:5e-04, Method:Compositional matrix adjust., Identities:38/117(32%), Positives:49/117(41%), Gaps:10/117(8%) Query 8 RADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAF----RN-AATPVGTAKTVKCD- 61 R DI DA+VNAAN G GV + R RN G AK + C Sbjct 76 RGDITLLEVDAIVNAANASLLGGGGVDGCIHRAAGPCLLAECRNLNGCETGHAK-ITCGY 134 Query 62 ---ETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 Y+IH VGP DLA Y++ + ++ SVA P +STGI+ Sbjct 135 DLPAKYVIHTVGPIARGHINGSHKEDLANCYQSSLKLVKENNLRSVAFPCISTGIYG 191 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain Cendehill] Sequence ID: Q9J6K9.2 Length: 2116 Range 1: 836 to 942 Score:46.2 bits(108), Expect:6e-04, Method:Compositional matrix adjust., Identities:39/107(36%), Positives:47/107(43%), Gaps:10/107(9%) Query 19 VVNAANHRGQVGDGVCRAV-----ARKWPQAFRNAATPVGTAKTV---KCDETYIIHAVG 70 VVNAAN G GVC A+ A R A P G A C T+IIHAV Sbjct 836 VVNAANEGLLAGSGVCGAIFANATAALAADCRRLAPCPTGEAVATPGHGCGYTHIIHAVA 895 Query 71 PNFNNTSEA--EGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 P A EG+ L AYR++ A + VA PLL G++ Sbjct 896 PRRPRDPAALEEGEALLERAYRSIVALAAARRWAYVACPLLGAGVYG 942 >RecName: Full=Macro domain-containing protein in non 5'region; AltName: Full=ORF1 [Streptomyces griseus] Sequence ID: Q9KHE2.1 Length: 177 Range 1: 6 to 132 Score:43.9 bits(102), Expect:8e-04, Method:Compositional matrix adjust., Identities:41/131(31%), Positives:55/131(41%), Gaps:20/131(15%) Query 1 APAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWP----------QAFR-NAA 49 +P + R DI D +VNAAN G GV A+ R+ +A R Sbjct 6 SPVVRLVRGDITDQSVDVIVNAANSSLLGGGGVDGAIHRRGGPDILAACRELRASRYGKG 65 Query 50 TPVGTAKTV---KCDETYIIHAVGPNFNNTSEAEGDRD--LAAAYRAVAAEINRLSISSV 104 P G A + D +I+H VGP F+ DR LA+ YR L S+ Sbjct 66 LPTGQAVATTAGRLDARWIVHTVGPVFSGAQ----DRSALLASCYRESLRLAAELGARSI 121 Query 105 AIPLLSTGIFS 115 A P +STGI+ Sbjct 122 AFPAISTGIYG 132 >RecName: Full=Macro domain-containing protein LA_4133 [Leptospira interrogans serovar Lai str. 56601] Sequence ID: Q8EYT0.1 Length: 175 Range 1: 9 to 160 Score:43.5 bits(101), Expect:0.001, Method:Compositional matrix adjust., Identities:45/154(29%), Positives:67/154(43%), Gaps:18/154(11%) Query 8 RADIATAIEDAVVNAANHRGQVGDGVCRAVAR-KWPQAF--------RNAATPVGTAKTV 58 + DI DA+VNAAN G GV A+ R P+ + VG A Sbjct 9 KEDITQLEVDAIVNAANSSLLGGGGVDGAIHRAGGPEILEECYKIREKQGECKVGEAVIT 68 Query 59 ---KCDETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 + + +IIH VGP ++ ++ E D L+ AY+ S+ ++A P +STGI+ Sbjct 69 TAGRLNAKFIIHTVGPIWSGGNKNE-DELLSNAYKNSLLLAKNHSLKTIAFPNISTGIYH 127 Query 116 AGKDRVH----QSLSHLLAAMDTTEARVTIYCRD 145 K+R QS++ L D V C D Sbjct 128 FPKERAAKIAIQSVTEFL-KQDNQIQTVFFVCFD 160 >RecName: Full=Macro domain-containing protein LIC_13295 [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] Sequence ID: Q72M93.1 Length: 175 Range 1: 9 to 160 Score:43.5 bits(101), Expect:0.001, Method:Compositional matrix adjust., Identities:45/154(29%), Positives:67/154(43%), Gaps:18/154(11%) Query 8 RADIATAIEDAVVNAANHRGQVGDGVCRAVAR-KWPQAF--------RNAATPVGTAKTV 58 + DI DA+VNAAN G GV A+ R P+ + VG A Sbjct 9 KEDITQLEVDAIVNAANSSLLGGGGVDGAIHRAGGPEILEECYKIREKQGECKVGEAVIT 68 Query 59 ---KCDETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 + + +IIH VGP ++ ++ E D L+ AY+ S+ ++A P +STGI+ Sbjct 69 TAGRLNAKFIIHTVGPIWSGGNKNE-DELLSNAYKNSLLLAKNHSLKTIAFPNISTGIYH 127 Query 116 AGKDRVH----QSLSHLLAAMDTTEARVTIYCRD 145 K+R QS++ L D V C D Sbjct 128 FPKERAAKIAIQSVTKFL-KQDNQIQTVFFVCFD 160 >RecName: Full=ADP-ribose glycohydrolase MACROD1; AltName: Full=MACRO domain-containing protein 1; AltName: Full=O-acetyl-ADP-ribose deacetylase MACROD1; AltName: Full=Protein LRP16; AltName: Full=[Protein ADP-ribosylaspartate] hydrolase MACROD1; AltName: Full=[Protein ADP-ribosylglutamate] hydrolase MACROD1 [Homo sapiens] Sequence ID: Q9BQ69.2 Length: 325 Range 1: 158 to 273 Score:44.3 bits(103), Expect:0.001, Method:Compositional matrix adjust., Identities:35/116(30%), Positives:49/116(42%), Gaps:8/116(6%) Query 8 RADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKCDET---- 63 R+DI DA+VNAAN G GV + R + + + KT K T Sbjct 158 RSDITKLEVDAIVNAANSSLLGGGGVDGCIHRAAGPLLTDECRTLQSCKTGKAKITGGYR 217 Query 64 ----YIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 Y+IH VGP A +L + Y + + + SVA P +STG+F Sbjct 218 LPAKYVIHTVGPIAYGEPSASQAAELRSCYLSSLDLLLEHRLRSVAFPCISTGVFG 273 >RecName: Full=Uncharacterized protein PH1513 [Pyrococcus horikoshii OT3] Sequence ID: O59182.1 Length: 190 Range 1: 5 to 137 Score:43.1 bits(100), Expect:0.001, Method:Compositional matrix adjust., Identities:33/135(24%), Positives:51/135(37%), Gaps:25/135(18%) Query 4 YAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVAR-----------------------K 40 + + R DI +A+VNAAN + G GV A+A+ Sbjct 5 FKIVRGDITKFRAEAIVNAANKYLEHGGGVAYAIAKAASGDVSEYTRISKEEMRRQLGKD 64 Query 41 WPQAFRNAATPVGTAKTVKCDETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLS 100 W + TP K + Y+IH VGP + + L A + + L Sbjct 65 WIEHGEVVVTP--PMKLKENGVKYVIHTVGPYCGGVWSKDKEEKLKLAILGALKKADELG 122 Query 101 ISSVAIPLLSTGIFS 115 + S+A P +S GI+ Sbjct 123 VKSIAFPAISAGIYG 137 >RecName: Full=Macro domain-containing protein SCO6450 [Streptomyces coelicolor A3(2)] Sequence ID: Q9ZBG3.1 Length: 169 Range 1: 8 to 126 Score:42.7 bits(99), Expect:0.002, Method:Compositional matrix adjust., Identities:39/121(32%), Positives:56/121(46%), Gaps:16/121(13%) Query 8 RADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAF-----RNAATPVG----TAKTV 58 + DI DA+VNAAN G GV A+ R+ A R A +G T + V Sbjct 8 QGDITRQSADAIVNAANSSLLGGGGVDGAIHRRGGPAILAECRRLRAGHLGKGLPTGRAV 67 Query 59 K-----CDETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGI 113 D ++IH VGP ++ T + G LA+ YR + L +VA P +STG+ Sbjct 68 ATTAGDLDARWVIHTVGPVWSATEDRSGL--LASCYRESLRTADELGARTVAFPAISTGV 125 Query 114 F 114 + Sbjct 126 Y 126 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain BRD1] Sequence ID: Q6X2U4.1 Length: 2116 Range 1: 836 to 942 Score:44.7 bits(104), Expect:0.002, Method:Compositional matrix adjust., Identities:39/107(36%), Positives:46/107(42%), Gaps:10/107(9%) Query 19 VVNAANHRGQVGDGVCRAV-----ARKWPQAFRNAATPVGTAKTV---KCDETYIIHAVG 70 VVNAAN G GVC A+ A R A P G A C T+IIHAV Sbjct 836 VVNAANEGLLAGSGVCGAIFASAAATLAEDCRRLAPCPTGEAVATPGHGCGYTHIIHAVA 895 Query 71 PNFNNTSEA--EGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 P A + + L AYR+V A + VA PLL GI+ Sbjct 896 PRRPQDPAALEQSEALLERAYRSVVALAAARRWACVACPLLGAGIYG 942 >RecName: Full=Macro domain-containing protein in sno 5'region; AltName: Full=ORF7 [Streptomyces nogalater] Sequence ID: Q9EYI6.1 Length: 181 Range 1: 8 to 126 Score:42.4 bits(98), Expect:0.002, Method:Compositional matrix adjust., Identities:40/123(33%), Positives:54/123(43%), Gaps:20/123(16%) Query 8 RADIATAIEDAVVNAANHRGQVGDGVCRAVARKWP-------QAFR----NAATPVGTAK 56 + DI DA+VNAAN G GV A+ R+ +A R P G A Sbjct 8 QGDITRQHADALVNAANSSLLGGGGVDGAIHRRGGPAILAECRALRASRYGEGLPTGRAV 67 Query 57 TVKC---DETYIIHAVGPNFNNTSEAEGDRD--LAAAYRAVAAEINRLSISSVAIPLLST 111 D ++IH VGP +++T DR LA+ YR L +VA P LST Sbjct 68 ATTAGDLDARWVIHTVGPVWSSTE----DRSDLLASCYRESLRLAGELGARTVAFPALST 123 Query 112 GIF 114 G++ Sbjct 124 GVY 126 >RecName: Full=ADP-ribose glycohydrolase MACROD2; AltName: Full=MACRO domain-containing protein 2; AltName: Full=O-acetyl-ADP-ribose deacetylase MACROD2; AltName: Full=[Protein ADP-ribosylaspartate] hydrolase MACROD2; AltName: Full=[Protein ADP-ribosylglutamate] hydrolase MACROD2 [Xenopus laevis] Sequence ID: Q6PAV8.1 Length: 418 Range 1: 74 to 189 Score:43.9 bits(102), Expect:0.002, Method:Compositional matrix adjust., Identities:33/116(28%), Positives:48/116(41%), Gaps:8/116(6%) Query 8 RADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKCDET---- 63 + DI DA+VNAAN G GV + R + +G +T + T Sbjct 74 KGDITQLEVDAIVNAANTSLLGGGGVDGCIHRASGPSLLAECRELGGCETGQAKITCGYE 133 Query 64 ----YIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 Y+IH VGP +DLA+ Y + I ++A P +STGI+ Sbjct 134 LPAKYVIHTVGPIARGHITPNHKQDLASCYNSSLTLATENDIRTIAFPCISTGIYG 189 >RecName: Full=Uncharacterized protein PYRAB06560 [Pyrococcus abyssi GE5] Sequence ID: Q9V0Y3.2 Length: 183 Range 1: 2 to 178 Score:42.4 bits(98), Expect:0.003, Method:Compositional matrix adjust., Identities:41/180(23%), Positives:70/180(38%), Gaps:31/180(17%) Query 4 YAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVA-----------------------RK 40 + V DI +A+VNAAN + G GV A+A R Sbjct 2 FRVVHGDITRFKAEAIVNAANKYLEHGGGVAYAIAKAASGDVSEYIRISKEEMRKQIGRD 61 Query 41 WPQAFRNAATPVGTAKTVKCDETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLS 100 W + TP K Y+IH VGP + + + L A + + L Sbjct 62 WIEHGEVVVTP--PLNLAKNGVKYVIHTVGPYCGGKWDEDKRKKLELAILGALKKADELG 119 Query 101 ISSVAIPLLSTGIFSAGKDRVHQSLS-----HLLAAMDTTEARVTIYCRDKTWEQKIKTV 155 + S+A P +S GI+ + V ++ L +A + T+ + +Y ++ +E +K + Sbjct 120 VRSIAFPAISAGIYGCPLEEVVKTFKLVVNEFLKSAKNVTDVYLVLYS-ERDYEVALKVL 178 >RecName: Full=Macro domain-containing protein PA3693 [Pseudomonas aeruginosa PAO1] Sequence ID: Q9HXU7.1 Length: 173 Range 1: 6 to 122 Score:42.4 bits(98), Expect:0.003, Method:Compositional matrix adjust., Identities:37/118(31%), Positives:51/118(43%), Gaps:9/118(7%) Query 6 VKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKCDET-- 63 V + DI DA+VNAAN G GV A+ R A + KT + T Sbjct 6 VWQGDITRLAVDAIVNAANSSLLGGGGVDGAIHRAAGAELVAACRLLHGCKTGEAKITRG 65 Query 64 ------YIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 ++IH VGP + E + LA+ YR A + +SVA P +S GI+ Sbjct 66 FRLPAAHVIHTVGPVWRGGDNGEAEL-LASCYRRSLALAEQAGAASVAFPAISCGIYG 122 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain M33] Sequence ID: Q86500.2 Length: 2116 Range 1: 836 to 942 Score:44.3 bits(103), Expect:0.003, Method:Compositional matrix adjust., Identities:39/107(36%), Positives:48/107(44%), Gaps:10/107(9%) Query 19 VVNAANHRGQVGDGVCRAV-----ARKWPQAFRNAATPVGTAKTV---KCDETYIIHAVG 70 VVNAAN G GVC A+ A R A P+G A C T+IIHAV Sbjct 836 VVNAANEGLLAGSGVCGAIFANATAALAADCRRLAPCPIGEAVATPGHGCGYTHIIHAVA 895 Query 71 PNFNNTSEA--EGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 P A EG+ L AYR++ A + VA PLL G++ Sbjct 896 PRRPRDPAALEEGEALLERAYRSIVALAAARRWARVACPLLGAGVYG 942 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus vaccine strain RA27/3] Sequence ID: O40955.1 Length: 2116 Range 1: 836 to 967 Score:43.9 bits(102), Expect:0.003, Method:Compositional matrix adjust., Identities:48/135(36%), Positives:60/135(44%), Gaps:14/135(10%) Query 19 VVNAANHRGQVGDGVCRAV-----ARKWPQAFRNAATPVGTAKTV---KCDETYIIHAVG 70 VVNAAN G GVC A+ A R A P G A C T+IIHAV Sbjct 836 VVNAANEGLLAGSGVCGAIFANATAALAADCRRLAPCPTGEAVATPGHGCGYTHIIHAVA 895 Query 71 PNFNNTSEA--EGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAGKDRVHQSLSHL 128 P A EG+ L AYR++ A + VA PLL G++ +SL Sbjct 896 PRRPRDPAALEEGEALLERAYRSIVALAAARRWARVACPLLGAGVYGWS---AAESLRAA 952 Query 129 LAAMDTTEA-RVTIY 142 LAA T A RV+++ Sbjct 953 LAATRTEPAERVSLH 967 >RecName: Full=Protein mono-ADP-ribosyltransferase PARP9; AltName: Full=ADP-ribosyltransferase diphtheria toxin-like 9; Short=ARTD9; AltName: Full=B aggressive lymphoma protein homolog; AltName: Full=Poly [ADP-ribose] polymerase 9; Short=PARP-9 [Mus musculus] Sequence ID: Q8CAS9.2 Length: 866 Range 1: 123 to 246 Score:43.5 bits(101), Expect:0.004, Method:Compositional matrix adjust., Identities:36/126(29%), Positives:54/126(42%), Gaps:18/126(14%) Query 5 AVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRN--------------AAT 50 +V + D+ + DAVVNAAN G G+ ++ + + Sbjct 123 SVWKDDLTRHVVDAVVNAANENLLHGSGLAGSLVKTGGFEIQEESKRIIANVGKISVGGI 182 Query 51 PVGTAKTVKCDETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINR--LSISSVAIPL 108 + A + C IIHAVGP + T+ L A R + + + L I +VAIP Sbjct 183 AITGAGRLPCH--LIIHAVGPRWTVTNSQTAIELLKFAIRNILDYVTKYDLRIKTVAIPA 240 Query 109 LSTGIF 114 LS+GIF Sbjct 241 LSSGIF 246 >RecName: Full=ADP-ribose glycohydrolase MACROD1; AltName: Full=MACRO domain-containing protein 1; AltName: Full=O-acetyl-ADP-ribose deacetylase MACROD1; AltName: Full=Protein LRP16; AltName: Full=[Protein ADP-ribosylaspartate] hydrolase MACROD1; AltName: Full=[Protein ADP-ribosylglutamate] hydrolase MACROD1 [Mus musculus] Sequence ID: Q922B1.2 Length: 323 Range 1: 153 to 271 Score:42.0 bits(97), Expect:0.007, Method:Compositional matrix adjust., Identities:34/119(29%), Positives:49/119(41%), Gaps:8/119(6%) Query 5 AVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKCDET- 63 ++ R DI DA+VNAAN G GV + R + + +T K T Sbjct 153 SLYRGDITKLEVDAIVNAANSSLLGGGGVDGCIHRAAGSLLTDECRTLQNCETGKAKITC 212 Query 64 -------YIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 Y+IH VGP A +L + Y + + + SVA P +STG+F Sbjct 213 GYRLPAKYVIHTVGPIAVGQPTASQAAELRSCYLSSLDLLLEHRLRSVAFPCISTGVFG 271 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain BRDII] Sequence ID: Q6X2U2.1 Length: 2116 Range 1: 836 to 942 Score:42.4 bits(98), Expect:0.009, Method:Compositional matrix adjust., Identities:37/107(35%), Positives:45/107(42%), Gaps:10/107(9%) Query 19 VVNAANHRGQVGDGVCRAV-----ARKWPQAFRNAATPVGTAKTV---KCDETYIIHAVG 70 VVNAAN G GVC A+ A R A P G A C +IIHAV Sbjct 836 VVNAANEGLLAGSGVCGAIFASAAASLAEDCRRLAPCPTGEAVATPGHGCGYAHIIHAVA 895 Query 71 PNFNNTSEA--EGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 P A + + L AYR++ A + VA PLL GI+ Sbjct 896 PRRPQDPAALEQSEALLERAYRSIVALAAARRWTCVACPLLGAGIYG 942 >RecName: Full=ADP-ribose glycohydrolase MACROD1; AltName: Full=MACRO domain-containing protein 1; AltName: Full=O-acetyl-ADP-ribose deacetylase MACROD1; AltName: Full=Protein LRP16; AltName: Full=[Protein ADP-ribosylaspartate] hydrolase MACROD1; AltName: Full=[Protein ADP-ribosylglutamate] hydrolase MACROD1 [Bos taurus] Sequence ID: Q2KHU5.1 Length: 325 Range 1: 158 to 273 Score:41.6 bits(96), Expect:0.010, Method:Compositional matrix adjust., Identities:33/116(28%), Positives:47/116(40%), Gaps:8/116(6%) Query 8 RADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKCDET---- 63 R DI DA+VNAAN G GV + R + + +T K T Sbjct 158 RGDITKLEVDAIVNAANSSLLGGGGVDGCIHRAAGPLLTDECRTLQNCETGKAKITCGYR 217 Query 64 ----YIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 Y+IH VGP + A +L + Y + + + S A P +STG+F Sbjct 218 LPAKYVIHTVGPIAHGEPSASQAAELRSCYLSSLDLLLEHRLRSAAFPCISTGVFG 273 >RecName: Full=Uncharacterized protein TV0719 [Thermoplasma volcanium GSS1] Sequence ID: Q97AU0.1 Length: 186 Range 1: 13 to 134 Score:40.4 bits(93), Expect:0.010, Method:Compositional matrix adjust., Identities:35/127(28%), Positives:56/127(44%), Gaps:22/127(17%) Query 6 VKRADIATAIEDAVVNAANHRGQVGDGV---------------CRAVAR-KWPQAFRNAA 49 + DI +A+VNAAN G GV C + R KWP+ Sbjct 13 IIEGDITDVNCEAIVNAANPSLMGGGGVDGAIHLKGGKTIDLECAELRRTKWPKGLPPGE 72 Query 50 TPVGTAKTVKCDETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRL-SISSVAIPL 108 + + +K Y+IH VGP + E + + ++ YR++ EI ++ I +A P Sbjct 73 ADITSGGKLKAK--YVIHTVGPIYRGQEE-DAETLYSSYYRSL--EIAKIHGIKCIAFPA 127 Query 109 LSTGIFS 115 +STGI+ Sbjct 128 ISTGIYG 134 >RecName: Full=Replicase polyprotein 1a; Short=pp1a; AltName: Full=ORF1a polyprotein; Contains: RecName: Full=Non-structural protein 1; Short=nsp1; AltName: Full=p9; Contains: RecName: Full=Non-structural protein 2; Short=nsp2; AltName: Full=p87; Contains: RecName: Full=Non-structural protein 3; Short=nsp3; AltName: Full=PL1-PRO/PL2-PRO; AltName: Full=PLP1/PLP2; AltName: Full=Papain-like proteinases 1/2; AltName: Full=p195; Contains: RecName: Full=Non-structural protein 4; Short=nsp4; AltName: Full=Peptide HD2; Contains: RecName: Full=3C-like proteinase; Short=3CL-PRO; Short=3CLp; AltName: Full=M-PRO; AltName: Full=nsp5; AltName: Full=p34; Contains: RecName: Full=Non-structural protein 6; Short=nsp6; Contains: RecName: Full=Non-structural protein 7; Short=nsp7; AltName: Full=p5; Contains: RecName: Full=Non-structural protein 8; Short=nsp8; AltName: Full=p23; Contains: RecName: Full=Non-structural protein 9; Short=nsp9; AltName: Full=p12; Contains: RecName: Full=Non-structural protein 10; Short=nsp10; AltName: Full=Growth factor-like peptide; Short=GFL; AltName: Full=p16; Contains: RecName: Full=Non-structural protein 11; Short=nsp11 [Human coronavirus 229E] Sequence ID: P0C6U2.1 Length: 4085 Range 1: 1282 to 1420 Score:42.0 bits(97), Expect:0.012, Method:Composition-based stats., Identities:42/152(28%), Positives:64/152(42%), Gaps:27/152(17%) Query 5 AVKRADIATAIE----DAVVNAANHRGQVGDGVCRAVARKWPQAFRNAA---------TP 51 A + D+ T + D +VNAAN G G+ +A+ + + Sbjct 1282 AFYQGDVDTVVNGVDFDFIVNAANENLAHGGGLAKALDVYTKGKLQRLSKEHIGLAGKVK 1341 Query 52 VGTAKTVKCDETYIIHAVGPNFNNTSEAEGDRD-LAAAYRAVAAEINRLSISSVAIPLLS 110 VGT V+CD I + VGP + + +RD L AY + E + P+LS Sbjct 1342 VGTGVMVECDSLRIFNVVGPR-----KGKHERDLLIKAYNTINNE-----QGTPLTPILS 1391 Query 111 TGIFSAGKDRVHQSLSHLLAAMDTTEARVTIY 142 GIF ++ SL LL +T E +V +Y Sbjct 1392 CGIFGI---KLETSLEVLLDVCNTKEVKVFVY 1420 >RecName: Full=Replicase polyprotein 1ab; Short=pp1ab; AltName: Full=ORF1ab polyprotein; Contains: RecName: Full=Non-structural protein 1; Short=nsp1; AltName: Full=p9; Contains: RecName: Full=Non-structural protein 2; Short=nsp2; AltName: Full=p87; Contains: RecName: Full=Non-structural protein 3; Short=nsp3; AltName: Full=PL1-PRO/PL2-PRO; AltName: Full=PLP1/PLP2; AltName: Full=Papain-like proteinases 1/2; AltName: Full=p195; Contains: RecName: Full=Non-structural protein 4; Short=nsp4; AltName: Full=Peptide HD2; Contains: RecName: Full=3C-like proteinase; Short=3CL-PRO; Short=3CLp; AltName: Full=M-PRO; AltName: Full=nsp5; AltName: Full=p34; Contains: RecName: Full=Non-structural protein 6; Short=nsp6; Contains: RecName: Full=Non-structural protein 7; Short=nsp7; AltName: Full=p5; Contains: RecName: Full=Non-structural protein 8; Short=nsp8; AltName: Full=p23; Contains: RecName: Full=Viral protein genome-linked nsp9; AltName: Full=Non-structural protein 9; Short=nsp9; AltName: Full=RNA-capping enzyme subunit nsp9; Contains: RecName: Full=Non-structural protein 10; Short=nsp10; AltName: Full=Growth factor-like peptide; Short=GFL; AltName: Full=p16; Contains: RecName: Full=RNA-directed RNA polymerase nsp12; Short=Pol; Short=RdRp; AltName: Full=nsp12; AltName: Full=p100; Contains: RecName: Full=Helicase; Short=Hel; AltName: Full=nsp13; AltName: Full=p66; AltName: Full=p66-HEL; Contains: RecName: Full=Exoribonuclease; Short=ExoN; AltName: Full=nsp14; Contains: RecName: Full=Uridylate-specific endoribonuclease; AltName: Full=NendoU; AltName: Full=nsp15; AltName: Full=p41; Contains: RecName: Full=Putative 2'-O-methyl transferase; AltName: Full=nsp16 [Human coronavirus 229E] Sequence ID: P0C6X1.1 Length: 6758 Range 1: 1282 to 1420 Score:42.0 bits(97), Expect:0.013, Method:Composition-based stats., Identities:42/152(28%), Positives:64/152(42%), Gaps:27/152(17%) Query 5 AVKRADIATAIE----DAVVNAANHRGQVGDGVCRAVARKWPQAFRNAA---------TP 51 A + D+ T + D +VNAAN G G+ +A+ + + Sbjct 1282 AFYQGDVDTVVNGVDFDFIVNAANENLAHGGGLAKALDVYTKGKLQRLSKEHIGLAGKVK 1341 Query 52 VGTAKTVKCDETYIIHAVGPNFNNTSEAEGDRD-LAAAYRAVAAEINRLSISSVAIPLLS 110 VGT V+CD I + VGP + + +RD L AY + E + P+LS Sbjct 1342 VGTGVMVECDSLRIFNVVGPR-----KGKHERDLLIKAYNTINNE-----QGTPLTPILS 1391 Query 111 TGIFSAGKDRVHQSLSHLLAAMDTTEARVTIY 142 GIF ++ SL LL +T E +V +Y Sbjct 1392 CGIFGI---KLETSLEVLLDVCNTKEVKVFVY 1420 >RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName: Full=Regulator of RNase III activity [Salmonella enterica subsp. enterica serovar Agona str. SL483] Sequence ID: B5F961.1 Length: 179 Range 1: 7 to 127 Score:40.0 bits(92), Expect:0.013, Method:Compositional matrix adjust., Identities:40/124(32%), Positives:54/124(43%), Gaps:17/124(13%) Query 6 VKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPV----GTAKT---- 57 V + DI DA+VNAAN G GV A+ R A +A + G +T Sbjct 7 VIQGDITQLSVDAIVNAANASLMGGGGVDGAIHRAAGPALLDACKLIRQQQGECQTGHAV 66 Query 58 ----VKCDETYIIHAVGPNFNNTSEAEGDRDLAAAYR--AVAAEINRLSISSVAIPLLST 111 K +IH VGP + E + L AAYR + AE N S+A P +ST Sbjct 67 ITPAGKLSAKAVIHTVGPVWRGGEYQEAEL-LEAAYRNCLLLAEANHF--RSIAFPAIST 123 Query 112 GIFS 115 G++ Sbjct 124 GVYG 127 >RecName: Full=Macro domain-containing protein TTE0995 [Caldanaerobacter subterraneus subsp. tengcongensis MB4] Sequence ID: Q8RB30.1 Length: 175 Range 1: 18 to 127 Score:40.0 bits(92), Expect:0.014, Method:Compositional matrix adjust., Identities:33/111(30%), Positives:48/111(43%), Gaps:13/111(11%) Query 17 DAVVNAANHRGQVGDGVCRAVARKWPQAF---------RNAATPVGTAKTVKCDE---TY 64 DA+VNAAN G GV A+ + A + P G A Y Sbjct 18 DAIVNAANSSLIGGGGVDGAIHKAGGPAIAEELKVIREKQGGCPTGHAVITGAGNLKAKY 77 Query 65 IIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 +IHAVGP + + E D LA+AY + ++ ++A P +STG + Sbjct 78 VIHAVGPIWKGGNHNE-DNLLASAYIESLKLADEYNVKTIAFPSISTGAYG 127 >RecName: Full=Macro domain-containing protein in gbd 3'region; AltName: Full=ORF2 [Cupriavidus necator] Sequence ID: Q44020.1 Length: 173 Range 1: 8 to 133 Score:39.7 bits(91), Expect:0.016, Method:Compositional matrix adjust., Identities:40/127(31%), Positives:54/127(42%), Gaps:13/127(10%) Query 6 VKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNA---------ATPVGTAK 56 V DI DA+VNAAN G GV A+ A + A P G A Sbjct 8 VVHGDITRMEVDAIVNAANSGLLGGGGVDGAIHGAGGSAIKEACRAIRDTQGGCPTGEAV 67 Query 57 TVKCDET---YIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGI 113 Y+IHAVGP + + E D LA AYR + + +A P +STGI Sbjct 68 ITTGGHLPAPYVIHAVGPVWQGGDQGE-DELLANAYRNSIRLAAQHHLRRLAFPNISTGI 126 Query 114 FSAGKDR 120 ++ ++R Sbjct 127 YAFPRER 133 >RecName: Full=Macro domain-containing protein mll7730 [Mesorhizobium japonicum MAFF 303099] Sequence ID: Q985D2.1 Length: 176 Range 1: 10 to 130 Score:40.0 bits(92), Expect:0.017, Method:Compositional matrix adjust., Identities:40/122(33%), Positives:54/122(44%), Gaps:9/122(7%) Query 6 VKRADIATAIEDAVVNAAN----HRGQVGDGVCRAVARKWPQAFRNA-ATPVGTAKTVKC 60 + DI DA+VNAAN G V + RA R+ R VG AK K Sbjct 10 IHTGDITKLDVDAIVNAANTLLLGGGGVDGAIHRAAGRELEVECRMLNGCKVGDAKITKG 69 Query 61 DET---YIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFSAG 117 + +IIH VGP + + E + LA+ YR+ SVA P +STG++ Sbjct 70 YKLPARHIIHTVGPVWQGGGKGEAEL-LASCYRSSLELAAANDCRSVAFPAISTGVYRYP 128 Query 118 KD 119 KD Sbjct 129 KD 130 >RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName: Full=Regulator of RNase III activity [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] Sequence ID: B5RBF3.1 Length: 179 Range 1: 7 to 127 Score:39.7 bits(91), Expect:0.017, Method:Compositional matrix adjust., Identities:39/124(31%), Positives:54/124(43%), Gaps:17/124(13%) Query 6 VKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPV----GTAKT---- 57 V + DI DA+VNAAN G GV A+ R A +A + G +T Sbjct 7 VIQGDITQLSVDAIVNAANASLMGGGGVDGAIHRAAGPALLDACKLIRQQQGECQTGHAV 66 Query 58 ----VKCDETYIIHAVGPNFNNTSEAEGDRDLAAAYRA--VAAEINRLSISSVAIPLLST 111 K +IH VGP + E + L AYR+ + AE N S+A P +ST Sbjct 67 ITPAGKLSAKAVIHTVGPVWRGGEHQEAEL-LEEAYRSCLLLAEANHF--RSIAFPAIST 123 Query 112 GIFS 115 G++ Sbjct 124 GVYG 127 >RecName: Full=Uncharacterized protein PF1536 [Pyrococcus furiosus DSM 3638] Sequence ID: Q8U0P9.1 Length: 183 Range 1: 4 to 181 Score:39.7 bits(91), Expect:0.020, Method:Compositional matrix adjust., Identities:41/179(23%), Positives:72/179(40%), Gaps:27/179(15%) Query 6 VKRADIATAIEDAVVNAANHRGQVGDGV-----------CRAVARKWPQAFRNAATP--V 52 V + DI +A+VNAAN + G GV R R +A R + Sbjct 4 VVKGDITKFRAEAIVNAANKYLEHGGGVAYAIAKAAAGDVREYIRISKEAMREQLGKDWI 63 Query 53 GTAKTV--------KCDETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSV 104 + V K Y+IH VGP + + + L A + + L + S+ Sbjct 64 DHGEVVVTPPLQLEKNGVKYVIHTVGPYCGGSWDEDKKSKLKLAILGALKKADELGVKSI 123 Query 105 AIPLLSTGIFSAGKDRVHQSLSHLL-----AAMDTTEARVTIYCRDKTWEQKIKTVLQN 158 A P +S GI+ ++V ++ ++ +A E + +Y ++ +E+ +K V Q Sbjct 124 AFPAISAGIYGCPLEKVVETFVEVVKEFLPSAKSLREVFLVLYSQE-DYEKALKIVGQG 181 >RecName: Full=Protein-ADP-ribose hydrolase; Short=SpyMacroD [Staphylococcus aureus subsp. aureus COL] Sequence ID: Q5HIW9.1 Length: 266 Range 1: 89 to 221 Score:40.4 bits(93), Expect:0.021, Method:Compositional matrix adjust., Identities:41/135(30%), Positives:58/135(42%), Gaps:23/135(17%) Query 6 VKRADIATAIEDAVVNAANHR-----------------GQVGDGVCRAVARKWPQAFRNA 48 V + DI T DA+VNAAN R + G V A Q RN Sbjct 89 VWQGDITTLKIDAIVNAANSRFLGCMQANHDCIDNIIHTKAGVQVRLDCAEIIRQQGRNE 148 Query 49 ATPVGTAKTVK---CDETYIIHAVGPNFNNTSEAEGDRD-LAAAYRAVAAEINRLSISSV 104 VG AK + YIIH VGP ++ ++D LA Y + ++ S++ V Sbjct 149 G--VGKAKITRGYNLSAKYIIHTVGPQIRRLPVSKMNQDLLAKCYLSCLKLADQHSLNHV 206 Query 105 AIPLLSTGIFSAGKD 119 A +STG+F+ +D Sbjct 207 AFCCISTGVFAFPQD 221 >RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName: Full=Regulator of RNase III activity [Shigella flexneri 5 str. 8401] Sequence ID: Q0T5Z6.1 Length: 177 Range 1: 7 to 127 Score:39.7 bits(91), Expect:0.022, Method:Compositional matrix adjust., Identities:41/126(33%), Positives:55/126(43%), Gaps:21/126(16%) Query 6 VKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAA---------TPVGTAK 56 V + DI D +VNAAN G GV A+ R A +A P G A Sbjct 7 VVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAV 66 Query 57 TVKCDE---TYIIHAVGPNFNNTSEAEGDRDLAAAY----RAVAAEINRLSISSVAIPLL 109 + ++H VGP + + E D+ L AY R VAA S +SVA P + Sbjct 67 ITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAAN----SYTSVAFPAI 121 Query 110 STGIFS 115 STG++S Sbjct 122 STGVYS 127 >RecName: Full=ADP-ribose glycohydrolase MACROD1; AltName: Full=MACRO domain-containing protein 1; AltName: Full=O-acetyl-ADP-ribose deacetylase MACROD1; AltName: Full=Protein LRP16; AltName: Full=[Protein ADP-ribosylaspartate] hydrolase MACROD1; AltName: Full=[Protein ADP-ribosylglutamate] hydrolase MACROD1 [Rattus norvegicus] Sequence ID: Q8K4G6.2 Length: 258 Range 1: 91 to 206 Score:40.0 bits(92), Expect:0.030, Method:Compositional matrix adjust., Identities:33/116(28%), Positives:48/116(41%), Gaps:8/116(6%) Query 8 RADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPVGTAKTVKCDET---- 63 R DI DA+VNAAN+ G GV + R + + +T K T Sbjct 91 RGDITKLEVDAIVNAANNSLLGGGGVDGCIHRAAGSLLTDECRTLQNCETGKAKITCGYR 150 Query 64 ----YIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 ++IH VGP A +L + Y + + + SVA P +STG+F Sbjct 151 LPAKHVIHTVGPIAVGQPTASQAAELRSCYLSSLDLLLEHRLRSVAFPCISTGVFG 206 >RecName: Full=Uncharacterized protein Ta1105 [Thermoplasma acidophilum DSM 1728] Sequence ID: Q9HJ67.2 Length: 196 Range 1: 12 to 140 Score:39.3 bits(90), Expect:0.030, Method:Compositional matrix adjust., Identities:38/130(29%), Positives:54/130(41%), Gaps:15/130(11%) Query 5 AVKRADIATAIEDAVVNAANHRGQVGDGVCRAV-ARKWPQ------AFRNAATPVG---- 53 AV+ DI + +A+VNAAN G GV A+ + P+ R P G Sbjct 12 AVEVGDITESDAEAIVNAANSSLMGGGGVDGAIHSAAGPELNGELVKIRRERYPNGLPPG 71 Query 54 ---TAKTVKCDETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLS 110 + + ++IIH VGP + E D L +YR+ I +A P LS Sbjct 72 EAVITRGYRLKASHIIHTVGPVWMGGRNGE-DDVLYRSYRSCLDLAREFGIHDIAFPALS 130 Query 111 TGIFSAGKDR 120 TG + DR Sbjct 131 TGAYGFPFDR 140 >RecName: Full=Macro domain-containing protein; AltName: Full=ORF549 [Acinetobacter sp. ED45-25] Sequence ID: Q93SX7.1 Length: 183 Range 1: 8 to 172 Score:39.3 bits(90), Expect:0.031, Method:Compositional matrix adjust., Identities:43/168(26%), Positives:67/168(39%), Gaps:20/168(11%) Query 8 RADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFR---------NAATPVGTAKTV 58 +ADI A+VN+AN G G+ + +K + P G A+ Sbjct 8 QADITAFAVHAIVNSANKSLLGGGGLDYVIHKKAGPLMKEECVRLNQEKGGCPTGQAEVT 67 Query 59 KCDET---YIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIPLLSTGIFS 115 Y+IHAVGP + + E + L AY + N + +V+ P +STG++ Sbjct 68 TAGNLPAKYLIHAVGPRWLDGEHNE-PQLLCDAYSNALFKANEIHALTVSFPCISTGVYG 126 Query 116 -----AGKDRVHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQN 158 A + + LS +L D A V CR+ K +L N Sbjct 127 FPPQKAAEIAIGTILS-MLPQYDHV-AEVFFICREDENYLIYKNILSN 172 >RecName: Full=Protein-ADP-ribose hydrolase; Short=SpyMacroD [Staphylococcus aureus subsp. aureus MSSA476] Sequence ID: Q6GCE6.1 Length: 266 >RecName: Full=Protein-ADP-ribose hydrolase; Short=SpyMacroD [Staphylococcus aureus subsp. aureus MW2] Sequence ID: Q8NYB7.1 Length: 266 Range 1: 89 to 221 Score:39.7 bits(91), Expect:0.032, Method:Compositional matrix adjust., Identities:41/135(30%), Positives:58/135(42%), Gaps:23/135(17%) Query 6 VKRADIATAIEDAVVNAANHR-----------------GQVGDGVCRAVARKWPQAFRNA 48 V + DI T DA+VNAAN R + G V A Q RN Sbjct 89 VWQGDITTLKIDAIVNAANSRFLGCMQANHDCIDNIIHTKAGVQVRLDCAEIIRQQGRNE 148 Query 49 ATPVGTAKTVKCDE---TYIIHAVGPNFNNTSEAEGDRD-LAAAYRAVAAEINRLSISSV 104 VG AK + YIIH VGP ++ ++D LA Y + ++ S++ V Sbjct 149 G--VGKAKITRGYNLPAKYIIHTVGPQIRRLPVSKMNQDLLAKCYLSCLKLADQHSLNHV 206 Query 105 AIPLLSTGIFSAGKD 119 A +STG+F+ +D Sbjct 207 AFCCISTGVFAFPQD 221 >RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName: Full=Regulator of RNase III activity [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] Sequence ID: P67341.1 Length: 179 >RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName: Full=Regulator of RNase III activity [Salmonella enterica subsp. enterica serovar Typhi] Sequence ID: P67342.1 Length: 179 Range 1: 7 to 127 Score:38.9 bits(89), Expect:0.034, Method:Compositional matrix adjust., Identities:39/124(31%), Positives:53/124(42%), Gaps:17/124(13%) Query 6 VKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPV----GTAKT---- 57 V + DI DA+VNAAN G GV A+ R A +A + G +T Sbjct 7 VIQGDITQLSVDAIVNAANASLMGGGGVDGAIHRAAGPALLDACKLIRQQQGECQTGHAV 66 Query 58 ----VKCDETYIIHAVGPNFNNTSEAEGDRDLAAAYR--AVAAEINRLSISSVAIPLLST 111 K +IH VGP + E + L AYR + AE N S+A P +ST Sbjct 67 ITPAGKLSAKAVIHTVGPVWRGGEHQEAEL-LEEAYRNCLLLAEANHF--RSIAFPAIST 123 Query 112 GIFS 115 G++ Sbjct 124 GVYG 127 >RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName: Full=Regulator of RNase III activity [Salmonella enterica subsp. enterica serovar Newport str. SL254] Sequence ID: B4T2X8.1 Length: 179 Range 1: 7 to 127 Score:38.9 bits(89), Expect:0.034, Method:Compositional matrix adjust., Identities:39/124(31%), Positives:53/124(42%), Gaps:17/124(13%) Query 6 VKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRNAATPV----GTAKT---- 57 V + DI DA+VNAAN G GV A+ R A +A + G +T Sbjct 7 VIQGDITQLSVDAIVNAANASLMGGGGVDGAIHRAAGPALLDACKLIRQQQGECQTGHAV 66 Query 58 ----VKCDETYIIHAVGPNFNNTSEAEGDRDLAAAYR--AVAAEINRLSISSVAIPLLST 111 K +IH VGP + E + L AYR + AE N S+A P +ST Sbjct 67 ITPAGKLSAKAVIHTVGPVWRGGEHQEAEL-LEEAYRNCLLLAEANHF--RSIAFPAIST 123 Query 112 GIFS 115 G++ Sbjct 124 GVYG 127 >RecName: Full=Protein-ADP-ribose hydrolase; Short=SpyMacroD [Staphylococcus aureus subsp. aureus Mu50] Sequence ID: P67343.1 Length: 266 >RecName: Full=Protein-ADP-ribose hydrolase; Short=SpyMacroD [Staphylococcus aureus subsp. aureus N315] Sequence ID: P67344.1 Length: 266 Range 1: 89 to 221 Score:39.7 bits(91), Expect:0.034, Method:Compositional matrix adjust., Identities:41/135(30%), Positives:58/135(42%), Gaps:23/135(17%) Query 6 VKRADIATAIEDAVVNAANHR-----------------GQVGDGVCRAVARKWPQAFRNA 48 V + DI T DA+VNAAN R + G V A Q RN Sbjct 89 VWQGDITTLKIDAIVNAANSRFLGCMQANHDCIDNIIHTKAGVQVRLDCAEIIRQQGRNE 148 Query 49 ATPVGTAKTVKCDE---TYIIHAVGPNFNNTSEAEGDRD-LAAAYRAVAAEINRLSISSV 104 VG AK + YIIH VGP ++ ++D LA Y + ++ S++ V Sbjct 149 G--VGKAKKTRGYNLPAKYIIHTVGPQIRRLPVSKMNQDLLAKCYLSCLKLADQHSLNHV 206 Query 105 AIPLLSTGIFSAGKD 119 A +STG+F+ +D Sbjct 207 AFCCISTGVFAFPQD 221